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Tetracycline assay method. 



FIELD OF THE INVENTION 

This invention relates to a method for the determination of a tetracycline in a 
5 sample. The invention also concerns recombinant prokaryotic cells capable of 

emitting light in response to the existence of a tetracycline in a sample. Furthermore, 
the invention relates to novel DNA vectors useful for the construction of said 
prokaryotic cells. 

10 BACKGROUND OF THE INVENTION 

The publications and other materials used herein to illuminate the background of the 
invention, and in particular, cases to provide additional details respecting the 
practice, are incorporated by reference. 

15 Whole cells can be used in methods based on the use of living cells or organisms as 
sensor tools of detection. Many of these methods utilize bacterial or yeast cells. 
Prokaryotic organisms and especially Escherichia coli bacterium are very well 
characterized and maps of genes and their sequences at nucleotide level are known. 
Therefore the behavior of the whole cell sensor can be better understood. Because 

20 of this fact it is also possible to develop analyte or group specific sensors utilizing 
different regulatory regions of genomes and also various microbial strains. 

Whole cells can be utilized in biosensors which are devices consisting of 1) a 
sensor, 2) a recording unit and 3) a possible connector such as fiber optic guide 
25 between 1 and 2. The recording unit has several choices of what is the physical 
background of the measurement. It can be change in heat, conductance, color 
reaction, changes in fluorescent properties, emission of endogenous light from the 
sensor cells etc. 
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Antibiotics used as medicines against microbial invasion are detected from body 
fluids in order to study the dosage and penetration of the medicine. Often the 
effective therapeutic range of the antibiotic is rather narrow and the risks of 
overdosage might be too big. It is also important to measure the presence or 
concentration of antibiotics from meat and milk due to syndrome of allergic people. 
In the course of cheese production milk used as starting material should not contain 
antibiotics due to the fact that cheesemaking bacteria are not able to work on 
contaminated milk. 

Conventional tests for the measurement of toxic substances such as antimicrobial 
agents (antibiotics) are based on the inhibition of growth. Growth inhibition can be 
followed by monitoring the zone where the growth of microbes is inhibited on a 
nutrient agar plate around a disk onto which an antibiotic dilution was pipetted. 
Typical examples of agar diffusion tests are cylindrical, hole or disk methods. The 
difference in these tests is only restricted in the way the sample is applied on the 
agar and also the way the bacteria in the test is used. Another means is to follow the 
metabolism of the test organisms by estimating the intensity of a color reaction 
which is affected by the inhibitory antibiotic present and comparing it to the 
uninhibited control (e.g. the commercial products: Delvo Test™, Brilliant black- 
reduction test, Charm Farm Test, Charm AIM-96 and Valio TlOl-test). Since 
microbiological methods utilize bacteria or their spores it is the sensitivity of the test 
bacteria which is of utmost importance. Thus far one had to make compromises in 
the choice of a suitable test strain since great sensitivity against antimicrobial agents 
5 and other characteristics needed for the test strain have not been common features 
for the same strain of bacteria. A major drawback when using microbes in antibiotic 
residue tests is slow and unsensitive performance. Since in these methods one 
always controls in a way or other the growth of the tester strain one cannot imagine 
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the test to be performed in an hour. This is due to the fact that the growth of the 
microbe is a slow phenomenon even at its fastest mode. Also in many cases 
microbes are in spores or freeze-dried, the regeneration of which makes the tests 
even more slow to perform. 

OBJECT AND SUMMARY OF THE INVENTION 
The object of the invention is to provide a novel method of determining a 
tertracycline in a sample where said method is rapid and selective for tetracyclines, 
i.e. the method is able to distinguish tetracyclines from other antimicrobial agents. 



10 



According to one aspect of the invention a method for the determination of a 
tetracycline in a sample is provided, wherein the method is characterized in that 

- the sample is brought into contact with prokaryotic cells encompassing a DNA 
vector including a nucleotide sequence encoding a light producing enzyme under 

15 transcriptional control of a tetracycline repressor and a tetracycline promoter, 

- detecting the luminescense emitted from the cells, and 

- comparing the emitted luminescence to the luminescence emitted from cells in a 
control containing no tetracycline 

- wherein a detectable luminescence higher than a luminescence of the control 
20 indicates the presence of tetracycline in the sample. 

According to another aspect, the invention concerns a recombinant prokaryotic cell 
which encompasses a DNA vector including a nucleotide sequence encoding a light 
producing enzyme, tetracycline repressor and tetracycline promoter. 



25 



According to yet another aspect, the invention concerns a plasmid which comprises 
either 
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_ the luxCDABE genes (SEQ ID NO: 3), tetracycline repressor (TetR) (SEQ ID 
NO: 1 1) and tetracycline promoter (TetA) (SEQ ID NO: 9) from TnlO, or 
- the insect luciferase gene (SEQ ID NO: 1), tetracycline repressor (TetR) (SEQ 
ID NO: 1 1) and tetracycline promoter (TetA) (SEQ ID NO: 9) from TnlO. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure la shows schematically the method according to this invention, where cells 
cloned with the plasmid pTetLuxl (SEQ ID NO: 3) are used. 

10 Figure lb shows schematically the method according to this invention, where ceils 
cloned with the plasmid pTetLucl (SEQ ID NO: 1) are used. 

Figure lc shows schematically the production of the luciferase enzyme, 



15 Figure 2 shows the plasmid pTetLuxl (SEQ ID NO: 3). 
Figure 3 shows the plasmid pTetLucl (SEQ ID NO: 1). 

Figure 4a shows the production of light (induction factor) versus concentration ■ 
20 tetracycline in samples for three different tetracyclines, 

Figure 4b shows the production of light (induction factor) versus concentration 
tetracycline in samples for further four different tetracyclines. 



25 Figure 5 shows the effect of magnesium ions on the sensitivity of 
according to the invention. 
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Figure 6 illustrates possibilities of changing the assay window for the method of the 
invention by adjusting magnesium ion concentration and pH. 

Figure 7 shows the induction factor versus tetracycline concentration when using 
5 freeze-dried E. coli in the determination of tetracycline. 

Figure 8 shows a comparison of the assays based on using cells with the plasmid 
pTetLucl (SEQ ID NO: 1) and with the plasmid pTetLuxl (SEQ ID NO: 3). 

10 Figure 9 shows induction factors versus antibiotic concentrations of a pig serum 
sample (cells E. coli K12, pTetLuxl). 

Figure 10 shows the effect of EDTA in a milk sample assay, and 

15 Figure 1 1 shows the light emission versus time for an assay according to the 
invention. 

DETAILED DESCRIPTION OF THE INVENTION 

The term "tetracycline" shall be understood to include any compound covered by the 
20 general structure formula 




^1 ^ 


6 I 


^ 5 ^ 














,OH 




.\ 10 




11 , 


<o 2 ^ 


^ 1 






CONK 



OH 



WO 99/25866 



PCT/FI98/00873 



and particularly the specific commercially available compounds listed in the table 
below. 



GENERIC NAME 
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Furthermore, the term "tetracycline" shall be understood to cover the metabolic and 
other reformulation/decomposition products thereof. 



The cells useful in the method of the invention are preferably Escherichia coli, 
which are stored in dried form, e.g. in lyophilized form before their use in the 
method according to the invention. Also freshly cultivated cells can be used. 

10 According to a preferred embodiment, the DNA vector including a nucleotide 

sequence encoding a light producing enzyme is a plasmid containing the luxCDABE 
genes (SEQ ID NO: 3), tetracycline repressor (TetR) (SEQ ID NO: 1 1) and 
tetracycline promoter (TetA) (SEQ ID NO: 9) from transposon TnlO. Particularly 
preferable is the plasmid pTetLuxl (SEQ ID NO: 3). 



15 



According to another preferred embodiment, the DNA vector including a nucleotide 
sequence encoding a light producing enzyme is a plasmid containing the insect 



WO 99/25866 



PCT/FI98/00873 



luciferase gene, tetracycline repressor (TetR) (SEQ ID NO: 1 1) and tetracycline 
promoter (TetA) (SEQ ID NO: 9) from TnlO. In this case the substrate for insect 
luciferase reaction, D-luciferin, is added to the mixture of the sample and the cells in 
order to initiate the luminescence of the cells. The plasmid is preferably pTetLucl 
5 (SEQ ID NO: 1). 

The method according to this invention is useful for the determination of 
tetracycline in various kinds of samples. As examples can be mentioned milk, fish, 
meat, infant formula, eggs, honey, vegetables, serum, plasma, whole blood or the 
10 like. 

The luminescence of the cells is preferably measured using an X-ray or polaroid 
film, a CCD-caLra (Charge Coupled Device), a liquid scintillation counter or, 
' most preferably, a^luminometer. 
15 

The sensitivity of this analysis method with respect to the tetracycline can be 
controlled by increasing or decreasing the concentration of divalent metal ions, e.g. 
magnesium ions, in the mixture of the sample and the cells, by adjusting the pH or 
by combined adjusting of the divalent metal ion concentration and the pH. 
20 Increasing concentration of magnesium ions decreases the sensitivity and vice versa. 
Increasing P H will also cause a decreasing sensitivity. The sensitivity of the analysis 
with respect to the tetracycline can be increased by the use of cells which are 
especially antibiotic sensitive mutant strains. Chelating agents such as EDTA can be 
added to further sensitize the sensor system for tetracyclines. 



25 



Figures 1 show a schematic representation of a method based on specific detection 
of the presence of tetracyclines using microbial cells cloned with either the plasmid 
pTetLuxl (SEQ ID NO: 3) (Figure la) or with the plasmid pTetLucl (SEQ ID 
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NO: 1) (Figure lb). The figures show that cells containing either of the plasmids 
can be triggered to produce light by adding a chemical agent (a tetracycline). Light 
production is a consequence of tetracycline responsive promoter activation due to 
removal of the tet-repressor protein (SEQ ID NO: 1 1) leading to the production of 
5 luciferase specific mRNA and luciferase protein (SEQ ID NO: 2, 4-8) itself. The 
principle is demonstrated in Figure lc. In case of the usage of full length bacterial 
luciferase operon (SEQ ID NO: 3) containing luxC, luxD, luxA, luxB and luxE 
genes (SEQ ID NO: 3) (Figure la), one is able to get light emission without addition 
of any substance. In case of insect (e.g. firefly) luciferase (SEQ ID NO: 2) (Figure 
10 lb), light is emitted only after the addition of D-luciferin. It should be noticed that 
the triggering of luciferase synthesis and light production commences immediately 
when the cells are introduced to the inducer molecules (tetracyclines). Therefore 
there is no need to use dividing cells and hence there is no need to use long 
cultivation of microbial cells such as the case is with conventional methods. 
15 Therefore, if needed, one can get results in minutes rather than in hours or days 
which is the case when conventional methods are used. 

Figure 2a shows the plasmid pTetLuxl (SEQ ID NO: 3), in which the production of 
bacterial luciferase (SEQ ID NO: 4-8) of Photorhabdus luminescens (formerly 
20 Xenorhabdus luminescens; the lux-operon structure and the full-length nucleotide 
sequence of P. luminescens was published in Szittner, R. and Meighen, E. (1990) J. 
Biol. Chem. 265, 16581-16587) can be switched on by the addition of a chemical 
agent belonging to the tetracycline family of antimicrobial agents in a cloned E. coli 
bacterium. SEQ ID NO: 3 shows the nucleotide sequence of the plasmid pTetLuxl. 
25 This plasmid construct is devised to contain the five genes from P. luminescens 
luciferase operon necessary for the light production without any additions of 
substrates, i.e. cells cloned with such a construct produce substrates endogenously. 
By incubating £. coli cells containing this plasmid (or any other microbial strain 
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whereto similar regulation/reporter gene system is incorporated containing the 
necessary secondary regulatory sequences in the constructs such as correct ribosome 
binding region, transcriptional termination etc.) in the presence of very small 
amounts of tetracyclines one is able to obtain light production the intensity of which 
5 is proportional to the concentration of tetracycline used. 

Any E. coli mutant strain and especially those strains having a mutation in the 
export/import machinery of the membranes or otherwise leaky character making it 
possible for large molecules to easily penetrate inside the cell would be beneficial to 

10 use in the method described in this invention. Also other gram-negative bacteria 
such as strains belonging to genus Salmonella, Shigella, Enter obacter, Citrobacter, 
Klebsiella, Erwinia, Pseudomonas, Serratia as well as gram-positive organisms 
such as those belonging to genus Bacillus (especially B. subtilis, B. licheniformis, 
B. pumilus, B. globigii, B. natto, B. amyloliquefaciens as well as B. niger, B. brevis, 

15 B. megaterium ), Streptomyces, Lactobacillus (especially L. lactis, L casei ) and 
Streptococcus (especially S. thermophilus, S. cremoris, S. agalactiae ) come into 
question. Especially asporogenic strains of Bacilli or Lactobacilli are suitable. 

Figure 3 shows the plasmid pTetLucl (SEQ ID NO: 1), in which the production of 
20 firefly luciferase (SEQ ID NO: 2) of Photinus pyralis (The gene encoding firefly 
luciferase was originally cloned and sequenced in the middle of the 1980' s by 
DeWet, J. et al. (1987) Mol. Cell. Biol. 7, 725-737) can be switched on by the 
addition of a chemical agent belonging to the tetracycline family of antimicrobial 
agents in a cloned E. coli bacterium. SEQ ID NO: 1 shows the nucleotide sequence 
25 of this plasmid. By incubating E. coli cells containing this plasmid (or any other 
microbial strain whereto similar regulation/reporter gene system is incorporated 
containing the necessary secondary regulatory sequences in the constructs such as 
correct ribosome binding region, transcriptional termination etc.) in the presence of 
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very small amounts of tetracyclines one is able to obtain light production by the 
addition of D-lucif erin, which is the substrate of firefly luciferase. The intensity of 
light emission is proportional to the concentration of tetracycline used. 

5 Figures 4a and 4b shows the effect of altogether seven different tetracyclines on the 
production of light as a function of concentration of each tetracycline. As controls 
different non-tetracycline antibiotics were included in this study to show that the 
sensor strain is specific for the tetracyclines. The luminescense was emitted from 
E. coli containing the plasmid pTetLuxl (SEQ ID NO: 3). The detection was made 

10 after an incubation of 90 min. All tetracyclines tested behaved in a very similar 
manner and induction efficiencies were at the same antibiotic concentration area. 
This makes this sensor even more attractive for analytical use for the determination 
of the tetracycline group of antibiotics. 

15 It should be noted that the accumulation of various tetracyclines into microbial cells 
is very strongly affected by the extracellular concentration of Mg 2+ ions. Figure 5 
shows the effect of increasing concentrations of Mg 2+ ions on the behavior of K coli 
cells containing the plasmid pTetLuxl (SEQ ID NO: 3). As can be seen the 
tetracycline response curve is shifted to the right as a function of increasing 
20 concentrations of added Mg 2+ ions. Thus by increasing the Mg 2+ ion concentration 
one is able to decrease the sensitivity of the tetracycline sensor described in this 
invention. This fact is of great importance in cases where one does not need a high 
sensitivity of the measurement and where the approximate concentration of the ion 
is roughly constant and known such as in milk, serum and plasma. 



25 



The sensitivity can be increased by removing magnesium ions from the assay 
mixture e.g. by adding a chelating agent forming a complex with magnesium. 
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Figure 6 shows the possibilities to change the assay window for tetracyclines by 
adjusting the magnesium ion concentri 
magnesium ion concentration and pH. 



adjusting the magnesium ion concentration and by combined adjustment of the 



5 The sensitivity of the assay can be increased by the use of cells which are especially 
antibiotic sensitive mutant strains. Hundreds of specific mutations for bacteria are 
known with which it is possible to study the activity of specific reactions. For 
instance trace amounts of antibiotics cause visible changes in the metabolism or in 
the cell membranes of antibiotic sensitive bacterial mutants. Mutations in cell wall 
10 structural components or biosynthetic enzymes as well as in transport and efflux 
proteins such as porins might have an effect on the behavior of each sensor. Using 
these kinds of mutations one is able to develop tests measuring residual antibiotics 
from biological material very sensitively. It is also rather simple to transfer new 
characteristics into bacterial cells by genetic engineering techniques. This 
15 phenomenon broadens the applicability of these organisms in tests utilizing whole 
cell sensor. 

Measurement ofeht emission can be done by using X-ray or polaroid film, using a 
. 3V liquid scintillationNcounter, a CCD-camera or a luminomete, The CCD-camera is an 
'C,* 20 instrumentwMchiVpableofdetectingverylowlevelsoflight.IntheapplicaUons 
of this invention sucn\kind of a device could be used for the detection of tetracycline 
residues in food material such as vegetables or meat. The detection of light emission 
could be directly monitored from the surface of the food material sprayed with 
engineered luminescent baWria. Either chemiluminescent (such as peroxidase - 
25 luminol) or biolurninescentLch as luciferase - luciferin) reactions can be utilized. 
The luminometric method is pVformed with the aid of genes encoding either 
bacterial or beetle luciferases su^h as those described in the Figures 2 and 4. 
Several luminescent bacterial specks such as V. harveyi, V.fischeri, P. leiognathi, 
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P. phosphoreum, Xenor\abdus luminescens etc. exist. Luminescent beetles are for 
example Luciola mingrelka, Photinus pyralis, Pyrophorus plagiophthalamus, 
Lampyris noctiluca, Phola\dactylus. etc. Also several eukaryotic species in the sea 
which luminesce, such as maWe ostracod Vargula hilgendorfii, jellyfish Aequorea 
victoria, batrachoidid fish Pohchtys notatus, pempherid fish Parapriacanthus 
ransonneti etc. exist. Fluorescent reporter proteins such as green fluorescent protein 
(GFP) or any of its variants coulee used in the methods described in this invention 
(Li, X. et al. (1997) J. Biol. ChemNp, 28545-28549). 

In this invention high detection sensitivity of the luminescent enzyme labels inside a 
living cell associated with tetracycline-specific induction of label synthesis is based 
on the use of optimal concentration of all the reactants inside the cell including the 
necessary cofactors and accessory enzymes. All luciferase genes from these 
organisms would presumably work in a similar manner as the two examples shown 
15 in this invention. These systems together with enhancers and modulators 

(wavelength, emission kinetics etc.) of light emission has been described in more 
detail in Campbell, A. "Chemiluminescence; principles and applications in biology 
and medicine", Weinheim; Deerfield Beach, FL; VCH; Chichester: Horwood, 1988. 

20 Peroxidases or oxidases can be used together with compounds such as luminol or 
acridines (for instance lucigenin) to yield luminescent signals suitable for a 
detection system described here. Enzymatically generated chemiluminescence offers 
great sensitivity and rapid detection, too, in assays described in this invention. 
Thermally stable dioxetanes (such as AMPPD and Lumigen PPD) can be 

25 enzymatically (such as alkaline phosphatase or (3-galactosidase) triggered to produce 
chemiluminescence (Schaap, A.P. et al. (1989) Clin. Chem. 35, 1863-1864). The 
only difference to the luciferase enzymes would be that these enzymes are capable 
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of cleaving a man-made substrate which gives light emission (chemiluminescence) 
and the luciferases cleave natural substrates to produce light (bioluminescence). 

Tetracycline-controlled expression systems are developed to express heterologous 
proteins in procaryotic and eucaryotic cells for the purpose of production under a 
tight control of tet-regulatory system ( Skerra, A. (1994) Gene 151, 131-135; 
Gossen, M. and Bujard, H. (1995) US Patent 5,464,758 ; Lutz, R. and Bujard, H. 
(1997) Nucleic Acids Res. 25, 1203-1210). 

A method to study various tetracyclines and their mode of action was developed by 
Chopra et al. (Chopra, I. et al. (1990) Antimicrob. Agents Chemother. 34, 111-116) 
The assay system developed in this study was based on expression of 
p-galactosidase gene inserted under the control of tetA-gene. The method resulted in 
less sensitive detection of tetracyclines compared to the invention described here. 
However in order to obtain maximum sensitivities Chopra et al. showed that it was 
necessary to add cyclic AMP (cAMP) to the medium which is an extremely 
expensive molecule to be used in routine applications. Furthermore, the method 
described by Chopra et al. contains a cell disruption stage by sonication in order to 
assay for the reporter gene activity, (3-galactosidase, which step is not practical. 
Instead, the method described in this invention does not contain any cell disruption. 
The activity of luciferase can be measured directly from living cells in real-time and 
in the case of pTetLuxl (SEQ ID NO: 3) there is no need of addition of any 
substrates. Therefore, promoter activation due to the presense/absense of 
tetracycline can be monitored continuously. 

EXPERIMENTS 

As cloning hosts and in antibiotic residue measurements various E. coli MCI 061 
(cl+, araD139, A(ara-teu)7696, ZacX74, galU, galK, hsr, hsm, strA) (Casadaban, 
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M.J. and Cohen, S.N. (1980) J. Mol. Biol. 138, 179-207), BW322 (CGSC, 
r/a210::Tnl0, thi-l, relAl, spoThpyrE) andK-12 (M72 Sm R lacZm-AbiouvrB, 
trpEAl, jVam7iVam53cI857 HI) (Remaut, E. et al. (1981) Gene 15, 81-93) can be 
used. Especially the strain LH530 (Hirvas, L. et al. (1997) Microbiology 143, 73-81) 
5 which has a decreased rate of lipid A biosynthesis. It has proven to be 
hypersusceptible to many different antibiotics. 

Cells were grown on appropriate minimal agar-plates and were kept maximally one 
month at +4 °C after which new plates were stroked. The strains were kept also in 
10 15% glycerol at -70 °C, where from growth was started through minimal plates. The 
cells were first cultivated in 5 ml of 2xTY medium (16 g Bacto tryptone, 8 g Yeast 
extract, 8 g NaCl, H 2 0 ad 1 1, pH 7.4, with appropriate antibiotic) 10 h at 30 °C in a 
shaker after which the cultivation was transferred to a bigger volume for 10 h with 
same medium. 

15 

Construction of tetracycline-responsive sensor plasmids: 

To construct a recombinant DNA vector carrying luciferase genes under the control 
of a tetracycline responsive elements two new vectors were created. In the first one 
modified firefly luciferase gene (SEQ ID NO: 1) from vector pBLuc* (Bonin, A.L. 
20 et al. (1994) Gene 141, 75-77) was excised by using restriction enzymesX&al and 
HmDIII and the 1.7 kb fragment was isolated from LGT-agarose gel and purified 
using Qiagen gel extraction kit. This DNA-fragment containing the entire Photinus 
pyrllis luciferase gene (SEQ ID NO: 1) was ligated using T4-DNA-ligase enzyme 
to vector P ASK75 (Skerra, A. (1994) Gene 151, 131-135) which was previously 
25 restricted with the same restriction enzymes Xbal and HmDIII and calf intestinal 
phosphatase treated to remove the protruding phosphate groups in order to prevent 
self-ligation. The resulting ligation mixture was incubated 3 hours at room 
temperature after which one |il of the mixture was electroporated according to 
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Dower et al (Dower, W.J. et al. (1988) Nucleic Acids Res. 16, 6126-6144) into 
electrocompetent E. coli MC1061 cells. A plasmid was extracted from one of the 
colonies obtained and checked for the estimated structure by appropriate restriction 
enzyme digestions and agarose gel electrophoretic techniques. The plasmid obtained 
5 was named as pTetLucl (SEQ ID NO: 1). 

The plasmid containing the luxCDABE genes (SEQ ID NO: 3) of Photorhabdus 
luminescens under the control of tetracycline responsive element was created as 
follows: Plasmid pASK75 was cut with restriction enzyme EcoRL and CIP-treated. 
10 The linearized plasmid was separated on a LGT-agarose gel electrophoresis and the 
agarose was removed by using the Qiagen kit. The lux operon was excised with 
EcoRI from plasmid pCGLS-1 1 (Frackman, S. et al. (1990) J. Bacteriol. 172, 5767- 
5773), gel purified as above and ligated to pASK75 by using T4-DNA-ligase at 16 
°C overnight. The ligation mixture was electroporated into E. coli MC1061 cells as 
15 described above and correct transformants were screened for their ability to produce 
light (as measured with a BioOrbit 1250 manual luminometer) which production 
was increased in the presence of 1 Hg/ml of tetracycline-HCl. The plasmid was 
further verified by restriction enzyme digestions and the correct structure was 
named as pTetLuxl (SEQ ID NO: 3). All the DNA-manipulations were performed 
20 according to Sambrook et al, "Molecular Cloning: A laboratory Manual, Cold 
Spring Harbor Laboratory Press: Cold Spring Harbor, NY, 1989. 

The vector pASK75 was utilized in the construction of tet-sensor plasmids shown in 
this invention. The vector pASK75 was originally developed for protein production 
25 and purification purposes. It contains a signal sequence for secretion of the 

recombinant protein into the periplasmic space of E. coli. Also a C-terminal fusion 
between a purification tail, the Strept-tag, was incorporated into the vector to 
facilitate purification of recombinant protein using streptavidin affinity agarose gel 
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chromatography. The element controlling recombinant gene expression in the vector 
is tetA promoter/operator system that allows efficient regulation of the expression, 
which in Skerra's paper was described for the production and one-step purification 
of a murine single-chain antibody fragment. The tetA promoter/operator (SEQ ID 
NO: 9) is controlled by tetR-repressor (SEQ ID NO: 9) which is produced by the 
corresponding gene (SEQ ID NO: 9). Some of the above mentioned elements were 
eliminated from the present plasmids due to unnecessary features with respect to this 



invention. 

10 Transfer of the tetracycline sensor vectors to the antibiotic sensitive E. coli 
strain: 

Either pTetLuxl (SEQ ID NO: 3) or pTetLucl (SEQ ID NO: 1) was transformed 
into E coli LH530 cells by electroporation as described above. The transformed 
cells were restreaked on agar plates and kept maximally for 2 weeks at +4 °C after 
15 which a new plate was streaked. 

Use of the manipulated E. coli in tetracycline determination methods: 

Example 1 

Freeze-dried E. coli K-12/pTetLuxl were reconstituted with 1 .0 ml of L-broth and 
20 bacteria were diluted 1:10 with 25 mM MES buffer in M9 minimal medium, P H 6.0. 
190 nl bacterial suspension was added to microtiter plate wells containing 10 \i\ of 
tetracycline dilutions. The plate was incubated 90 minutes at 37 °C after which the 
plate was measured with Labsystems Luminoskan luminometer. As seen from 
Figure 7 the sensitivity of the assay of tetracycline is very high and comparable to 
25 that of fresh cells. 
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Example 2 

Two different types of sensor DNA vector construct were compared. Strains E. coli 
K-12/pTetLuxl and E. coli K-12/pTetLucl were cultivated in L-broth media until 
optical density measured at 600 ran (OD600) was 1.5. The cells were diluted 1 to 50 
5 with 25 mM MES-buffer in M9 minimal medium, pH 6.0 (Sambrook et aL, 1 989, 
Cold Spring Harbor Laboratory, Cold Spring Harbor) and 190 jil was added to 
microtitration plate wells and 10 nl of sample dilution of tetracycline was added. 
After a 60 min incubation at 37 °C the light emission was measured using a 
Labsystems Luminoskan luminometer. Figure 8 shows the bioluminescence dose 
10 response curve as a function of tetracycline added. As seen from the figure both 

systems (bacterial and insect luciferase) give roughly equal sensitivity of 

tetracycline detection. 

One is able to use different luciferases instead of bacterial luciferase (SEQ ID 
15 NO: 4-8) from P. luminescens without losing sensitivity or other performance of the 
test. Figure 8 shows an analogous measurement to the one in Figure 4b. In the 
plasmid used in this test (pTetLucl) the bacterial luciferase was compensated with 
firefly luciferase (SEQ ID NO: 2) as described in Figure 3. The test was done 
essentially as with bacterial luciferase except that after the cells had been incubated 
20 with or without tetracycline 10 minutes at 37 °C the cells were measured for light 
production after 15 minutes incubation time at 37 °C by adding 100 (4 of solution 
containing 1 mM D-luciferin, in 0.1 M Na-citrate buffer, pH 5.0. Thereafter light 
production was measured using a manual luminometer 1250 (LKB-Wallac, Turku, 
Finland). As can be seen from Figure 8 sensitivity of the method to detect 
25 tetracycline hydrochloride is extremely high and comparable to the detection made 
with bacterial luciferase. 
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Example 3 

A lipemic pig serum was spiked at different concentrations of tetracycline, 
chlorotetracycline and oxy tetracycline. Fresh E. coli K-12/pTetLuxl were diluted 
1:50 with 25 mM MES buffer in M9 minimal medium, pH 6.0. 100 \i\ bacterial 
5 suspension was added to microtiter plate wells containing 100 ul of pig serum 
spiked with different tetracyclines. The plate was incubated 90 minutes at 37 °C 
after which the plate was measured with Labsystems Luminoskan luminometer. As 
seen from Figure 9 the sensitivity of the assay of different tetracyclines in pig serum 
matrix is very high. 

10 

Example 4 

Tetracyclines will form chelate complexes with Ca 2+ and Mg 2+ in samples (e.g. 
milk), and loose their antimicrobial and induction activity in our assay system. 
Tetracyclines can be displaced from cation chelates by using strong chelating agents 

15 such as EDTA. Figure 10 shows the determination of tetracycline from a milk 
sample, which is spiked with different concentrations of tetracycline. Different 
amounts of EDTA were added to milk samples and this kind of displacement of 
cation-tetracycline complex clearly improved the sensitivity of the assay. In the 
assay we used freeze-dried E. coli K12/pTetLuxl that were reconstituted with 

20 L-broth 10 minutes in room temperature before the assay. 

Example 5 

Figure 1 1 shows the kinetics of bacterial bioluminescence after exposure of E. coli 
K-12/pTetLuxl to different dilutions of tetracycline antibiotics. The specific 
25 induction of tetracycline is very fast and specific light emission is seen already at the 
10 minutes measuring point in the assay. 
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It will be appreciated that the methods of the present invention can be incorporated 
in the form of a variety of embodiments, only a few of which are disclosed herein. It 
will be apparent for the specialist in the field that other embodiments exist and do 
not depart from the spirit of the invention. Thus, the described embodiments are 
illustrative and should not be construed as restrictive. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: KORPELA, Matti 

(B) STREET: Mai jamaentie 13 

(C) CITY: Naantali 

(E) COUNTRY: Finland 

(F) POSTAL CODE ( ZIP) : FIN-21100 

(A) NAME: KARP, Matti 

(B) STREET: Kampakatu 1 

(C) CITY: Kaarina 

(E) COUNTRY: Finland 

(F) POSTAL CODE (ZIP): FIN/20660 

(A) NAME: KURITTU, Jussi/ 

(B) STREET: Puutarhakatja 16 A 20 

(C) CITY: Turku 

(E) COUNTRY: Finl_ 

(F) POSTAL CODE /{Zip) : FIN-20100 

(ii) TITLE OF INVENTI^l^A^EW ASSAY METHOD 
(iii) NUMBER OF SEQUENCES i 

(iv) COMPUTER READi^LE FO^ 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERAT/NG SYSTEM: PC -DOS/MS-DOS 

(D) SOFTW/&E: Patentln Release #1.0, Version #1.30 (EPO) 

(Vi) PRIOR APPLICATION DATA: 

(A) AP/LICATION NUMBER: FI 974235 

(B) FHiING DATE: 14-NOV-1997 



PCT/FI98/00873 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4846 base pairs 
/(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : c ircular 

(i/) MOLECULE TYPE: DNA (genomic) 
(i/i) HYPOTHETICAL: NO 
/(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Photinus pyralis 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: pTetLucl 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT: Plasmid 
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H — j 



(ix) FEATURE: 

(A) NAME/ KEY : mxsc_feature 

(B) LOCATION :1. .3098 ..u^^r nAC5K75 n 

feature description below, SEQ ID 9-11. / 
/citation= ( [2] ) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

( (D) OTS? I ?^0^Ti0N!/product= "Photinus py/alis 
luciferase" 
/citation= ([ID 

(X) PUBLICATION INFORMATION: 

\i] pyralis lucifera/e: vectors that 

contain a modified luc coding sequence allowing 
convenient transfer into o*her systems 

(C) JOURNAL: Gene 

(D) VOLUME: 141 
(F) PAGES: 75-77 

IS R^IvaS'rESIDUES IN SECT ID NO: 1: FROM 3099 TO 4772 



etracycline promoter for the 
id production of a murine antxbody 
Lerichia coli 



(X) PUBLICATION INFORMATION: 

(A) AUTHORS: Stt ^ 

(B) TITLE: Use o 

tightlyVr 
fragment in 

(C) JOURNAL: Gene 

(D) VOLUME: 151 

(E) ISSUE: 1-2 

(F) PAGES: 131-^35 

IS! SS^'SiSS ™ ID NO: 1: FROM 1 TO 3098 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
AGCTTGACCT GTGAAGT^AA AAATGGCGCA CATTGTGCGA 
TACCGCTACT GCGTC^GGA TCTCCACGCG CCCTGTAGCG 
GTGGTGGTTA CGCGCAGCGT GACCGCTACA CTTGCCAGCG 
GCTTTCTTCC Ct4cTTTCT CGCCACGTTC GCCGGCTTTC 
GGGCTCCCTT /aGGGTTCCG ATTTAGTGCT TTACGGCACC 
TAGGGTGATG GTTCACGTAG TGGGCCATCG CCCTGATAGA 
TTGGAGTteA CGTTCTTTAA TAGTGGACTC TTGTTCCAAA 
ATCT9^GTCT ATTCTTTTGA TTTATAAGGG ATTTTGCCGA 
AA/GAGCTGA TTTAACAAAA ATTTAACGCG AATTTTAACA 
CAGGTGGCA CTTTTCGGGG AAATGTGCGC GGAACCCCTA 
'cATTCAAATA TGTATCCGCT CATGAGACAA TAACCCTGAT 
AAAAGGAAGA GTATGAGTAT TCAACATTTC CGTGTCGCCC 
TTTTGCCTTC CTGTTTTTGC TCACCCAGAA ACGCTGGTGA 
CAGTTGGGTG CACGAGTGGG TTACATCGAA CTGGATCTCA 



CATTTTTTTT 
GCGCATTAAG 
CCCTAGCGCC 
CCCGTCAAGC 
TCGACCCCAA 
CGGTTTTTCG 
CTGGAACAAC 
TTTCGGCCTA 
AAATATTAAC 
TTTGTTTATT 
AAATGCTTCA 
TTATTCCCTT 
AAGTAAAAGA 
ACAGCGGTAA 



GTCTGCCGTT 
CGCGGCGGGT 
CGCTCCTTTC 
TCTAAATCGG 
AAAACTTGAT 
CCCTTTGACG 
ACTCAACCCT 
TTGGTTAAAA 
GCTTACAATT 
TTTCTAAATA 
ATAATATTGA 
TTTTGCGGCA 
TGCTGAAGAT 
GATCCTTGAG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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AGTTTTCGCC CCGAAGAACG TTTTCCAATG ATGAGCACTT TTAAAGTTCT GCTATGTGGC / 900 
GCGGTATTAT CCCGTATTGA CGCCGGGCAA GAGCAACTCG GTCGCCGCAT ACACTATTCt/ 960 

CAGAATGACT TGGTTGAGTA CTCACCAGTC ACAGAAAAGC ATCTTACGGA TGGCATGKCA 1020 

GTAAGAGAAT TATGCAGTGC TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT 1080 

CTGACAACGA TCGGAGGACC GAAGGAGCTA ACCGCTTTTT TGCACAACAT GG^GGATCAT 1140 

GTAACTCGCC TTGATCGTTG GGAACCGGAG CTGAATGAAG CCATACCAAA/CGACGAGCGT 1200 

GACACCACGA TGCCTGTAGC AATGGCAACA ACGTTGCGCA AACTATTAXC TGGCGAACTA 1260 

CTTACTCTAG CTTCCCGGCA ACAATTGATA GACTGGATGG AGGCG^ATAA AGTTGCAGGA 1320 

CCACTTCTGC GCTCGGCCCT TCCGGCTGGC TGGTTTATTG CTG^TAAATC TGGAGCCGGT 1380 

GAGCGTGGCT CTCGCGGTAT CATTGCAGCA CTGGGGCCAG A^GGTAAGCC CTCCCGTATC 1440 

GTAGTTATCT ACACGACGGG GAGTCAGGCA ACTATGGATg/aACGAAATAG ACAGATCGCT 1500 

GAGATAGGTG CCTCACTGAT TAAGCATTGG TAGGAATTAA TGATGTCTCG TTTAGATAAA 1560 

AGTAAAGTGA TTAACAGCGC ATTAGAGCTG CTTAATGAGG TCGGAATCGA AGGTTTAACA 1620 

ACCCGTAAAC TCGCCCAGAA GCTAGGTGTJA GAGOAGCCTA CATTGTATTG GCATGTAAAA 1680 

AATAAGCGGG CTTTGCTCGA CGCCTTA^CN^^^GACATGT TAGATAGGCA CCATACTCAC 1740 

TTTTGCCCTT TAGAAGGGGA AAGCTGGCAA /GAT^TTTTAC GTAATAACGC TAAAAGTTTT 1800 

AGATGTGCTT TACTAAGTCA TCGCGATGGA GCAAAAGTAC ATTTAGGTAC ACGGCCTACA 1860 

GAAAAACAGT ATGAAACTCT CGAAAATCAA TTAGCCTTTT TATGCCAACA AGGTTTTTCA 1920 

CTAGAGAATG CATTATATGC ACTCAGCGCA GTGGGGCATT TTACTTTAGG TTGCGTATTG 1980 

GAAGATCAAG AGCATCAAGT CGOTAAAGAA GAAAGGGAAA CACCTACTAC TGATAGTATG 2040 

CCGCCATTAT TACGACAAGC TATCGAATTA TTTGATCACC AAGGTGCAGA GCCAGCCTTC 2100 

TTATTCGGCC TTGAATTGAl/ CATATGCGGA TTAGAAAAAC AACTTAAATG TGAAAGTGGG 2160 

TCTTAAAAGC AGCATAACCT TTTTCCGTGA TGGTAACTTC ACTAGTTTAA AAGGATCTAG 2220 

GTGAAGATCC TTTTTg/tAA TCTCATGACC AAAATCCCTT AACGTGAGTT TTCGTTCCAC 2280 

TGAGCGTCAG ACCCCKJTAGA AAAGATCAAA GGATCTTCTT GAGATCCTTT TTTTCTGCGC 2340 

GTAATCTGCT GCOTGCAAAC AAAAAAACCA CCGCTACCAG CGGTGGTTTG TTTGCCGGAT 2400 

CAAGAGCTAC CAACTCTTTT TCCGAAGGTA ACTGGCTTCA GCAGAGCGCA GATACCAAAT 2460 

actgtccttc/tagtgtagcc GTAGTTAGGC CACCACTTCA AGAACTCTGT AGCACCGCCT 2520 

ACATACCT^G CTCTGCTAAT CCTGTTACCA GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT 2580 
CTTACC^GGT TGGACTCAAG ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGGCTGAACG 2640 
GGGQOTTCGT GCACACAGCC CAGCTTGGAG CGAACGACCT ACACCGAACT GAGATACCTA 2700 
CAJSCGTGAGC TATGAGAAAG CGCCACGCTT CCCGAAGGGA GAAAGGCGGA CAGGTATCCG 2760 
GTAAGCGGCA GGGTCGGAAC AGGAGAGCGC ACGAGGGAGC TTCCAGGGGG AAACGCCTGG 2820 
JrATCTTTATA GTCCTGTCGG GTTTCGCCAC CTCTGACTTG AGC GTCGATT TTTGTGATGC 2880 
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TCGTCAGGGG GGCGGAGCCT ATGGAAAAAC GCCAGCAACG 

GCCTTTTGCT GGCCTTTTGC TCACATGACC CGACACCATC 

CCTAATTTTT GTTGACACTC TATCATTGAT AGAGTTATTT 

AGAGAAAAGT GAAATGAATA GTTCGACAAA AATCTAGAAC 

ATG GAA GAC GCC AAA AAC ATA AAG AAA GGC CCG 
Met Glu Asp Ala Lys Asn lie Lys Lys Gly Pro 
15 10 

CTA GAG GAT GGA ACC GCT GGA GAG CAA CTG CAT 
Leu Glu Asp Gly Thr Ala Gly Glu Gin Leu His 
20 25 

TAC GCC CTG GTT CCT GGA ACA ATT GCT TTT ACA 
Tyr Ala Leu Val Pro Gly Thr He Ala Phe Thr 
35 40 



CGGCCTTTTT ACGGTTCCTG 

GAATGGCCAG ATGATTAA^ 

TACCACTCCC TATCAG^GAT 

TAGTGGATCC CCOOTACC 

GCG CCA TTC /TAT CCG 
Ala Pro * Ph</ Tyr Pro 
15 

AAG GCT/ ATG AAG AGA 
Lys A/a. Met Lys Arg 
30 

GCA CAT ATC GAG 
Ala His He Glu 
45 



GTG AAC ATC ACG TAC GCG GAA TAC TTC GAA 
Val Asn He Thr Tyr Ala Glu Tyr Phe Gluy 
50 55 7 



GAA GCT ATG AAA CGA TAT GGG 
Glu Ala Met Lys Arg Tyr Gly 
65 70 

TGC AGT GAA AAC TCT CTT CAA TTC/ 
Cys Ser Glu Asn Ser Leu Gin Pfc 
85 



TG TCC GTT CGG TTG GCA 
Ser Val Arg Leu Ala 
60 

AAT CAC AGA ATC GTC GTA 
Asn His Arg He Val Val 
75 80 



CCG GTG TTG GGC GCG TTA 
Met Pro Val Leu Gly Ala Leu 
90 95 



TTT ATC GGA GTT GCA GTT GCC 
Phe He Gly Val Ala Val 
100 



'CCC GCG AAC GAC ATT TAT AAT GAA CGT 
Pro Ala Asn Asp He Tyr Asn Glu Arg 
105 HO 



GAA TTG CTC AAC AGT 
Glu Leu Leu Asn Ser 
115 

TCC AAA AAG GGG 
Ser Lys Lys Gly L^ 
130 



AWAAC ATT TCG CAG CCT ACC GTA GTG TTT GTT 
Asn He Ser Gin Pro Thr Val Val Phe Val 
120 125 

'CAA AAA ATT TTG AAC GTG CAA AAA AAA TTA CCA 
Gin Lys He Leu Asn Val Gin Lys Lys Leu Pro 
135 140 



ATA ATC CAG 
He He Gin 
145 



'AIT ATT ATC ATG GAT TCT AAA ACG GAT TAC CAG GGA 
He He He Met Asp Ser Lys Thr Asp Tyr Gin Gly 
150 155 160 



TTT CAG TCg/aTG TAC ACG TTC GTC ACA TCT CAT CTA CCT CCC GGT TTT 
Phe Gin Sef Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe 
165 170 175 

AAT GAa/tAC GAT TTT GTA CCA GAG TCC TTT GAT CGT GAC AAA ACA ATT 
Asn Glid Tyr Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr He 
180 185 190 

ATA ATG AAC TCC TCT GGA TCT ACT GGG TTA CCT AAG GGT GTG 
Ala 7 Leu He Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val 
195 200 205 

a^CC CTT CCG CAT AGA ACT GCC TGC GTC AGA TTC TCG CAT GCC AGA GAT 
/Ala Leu Pro His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp 
210 215 220 

CCT ATT TTT GGC AAT CAA ATC ATT CCG GAT ACT GCG ATT TTA AGT GTT 
Pro He Phe Gly Asn Gin He He Pro Asp Thr Ala He Leu Ser Val 
225 230 235 240 



2940 
3000 
3060 
3118 
3166 

3214 

3262 

3310 

3358 

3406 

3454 

3502 

3550 

3598 

3646 

3694 

3742 

3790 

3838 
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GTT CCA TTC CAT CAC GGT TTT GGA ATG TTT ACT ACA CTC GGA TAT TTG 
SS Pro Phe Sis Sis Gly Phe Gly Met Phe Thr Thr Leu Gly Tyr Leu 

245 250 

ATA TGT GGA TTT CGA GTC GTC TTA ATG TAT AGA TTT GAA GAA GAG CI 
He Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe Glu Glu Glu y 
260 265 2/0 ' 



TTT TTA CGA TCC CTT CAG GAT TAC AAA ATT CAA AGT GCG TTG 
Phe Leu Arg Ser Leu Gin Asp Tyr Lys He Gin Ser Ala Leu 
275 280 



GTA 
Val 



KC AAA TAC 
Lys Tyr 



CCA ACC CTA TTT TCA TTC TTC GCC AAA AGC ACT CTG ATT 
Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu lie 
290 295 300 

GAT TTA TCT AAT TTA CAC GAA ATT GCT TCT GGG GGC /cA CCT CTT TCG 
Sp Su III J£n Leu His Glu lie Ala Ser Gly GlV^la Pro Leu Ser 
305 310 315 

AAA GAA GTC GGG GAA GCG GTT GCA AAA CGC TTc/cAT CTT CCA GGG ATA 
Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe" Hxs Leu Pro Gly He 
325 330 / 



CGA CAA GGA TAT GGG CTC ACT GAG 
Arg Gin Gly Tyr Gly Leu Thr Glu 
340 



'TCA GCT ATT CTG ATT ACA 
Ser Ala He Leu He Thr 
345/>^ 3 50 

'gTC/GGT AAA GTT GTT CCA TTT 
Gly Lys Val Val Pro Phe 
365 



CCC GAG GGG GAT GAT AAA CCG GGC 
Pro Glu Gly Asp Asp Lys Pro Gly Al 
355 3 60 

TTT GAA GCG AAG GTT GTG GAT CT^ GAT ACC GGG AAA ACG CTG GGC GTT 
Phe Glu Ala Lys Val Val Asp Jjfx Asp Thr Gly Lys Thr Leu Gly Val 
370 3 75 / 380 



AAT CAG AGA GGC GAA TTA TG? 
Asn Gin Arg Gly Glu Leu 
385 3 ^0 



GTC AGA GGA CCT ATG ATT ATG TCC GGT 
Val Arg Gly Pro Met He Met Ser Gly 
395 400 



TAT GTA AAC AAT CCG G2 
Tyr Val Asn Asn Pro 
405 



GCG ACC AAC GCC TTG ATT GAC AAG GAT GGA 
Ala Thr Asn Ala Leu He Asp Lys Asp Gly 
410 415 



TGG CTA CAT TCT 
Trp Leu His Ser G] 
420 



'QAC ATA GCT TAC TGG GAC GAA GAC GAA CAC TTC 
Asp He Ala Tyr Trp Asp Glu Asp Glu His Phe 
425 430 



TTC ATA GTT 
Phe He Val 
435 



'CGC TTG AAG TCT TTA ATT AAA TAC AAA GGA TAC CAG 
Arg Leu Lys Ser Leu He Lys Tyr Lys Gly Tyr Gin 
440 445 



GTG GCC CCC/bCT GAA TTG GAG TCG ATA TTG TTA CAA CAC CCC AAC ATC 
Val Ala Pre/ Ala Glu Leu Glu Ser He Leu Leu Gin His Pro Asn He 
450 / 455 460 



TTC GAC 
Phe Asp/ 
465 



-G GGC GTG GCA GGT CTT CCC GAC GAT GAC GCC GGT GAA CTT 
Gly Val Ala Gly Leu Pro Asp Asp Asp Ala Gly Glu Leu 
470 475 480 



CCC G<£C GCC GTT GTT GTT TTG GAG CAC GGA AAG ACG ATG ACG GAA AAA 
Pro Ala Val Val Val Leu Glu His Gly Lys Thr Met Thr Glu Lys 

485 490 495 

GAg/ ATC GTG GAT TAC GTC GCC AGT CAA GTA ACA ACC GCC AAA AAG TTG 
GWi He Val Asp Tyr Val Ala Ser Gin Val Thr Thr Ala Lys Lys Leu 
500 505 510 



3886 



3934 



3982 



4030 



4078 



4126 



4174 



4222 



4270 



4318 



4366 



4414 



4462 



4510 



4558 



4606 



4654 
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515 520 

530 535 
GGC GGA AAG TCC AAA TTG TAAAATGTAA CTGTATTCAG CGATGACGAA 
Gly Gly Lys Ser Lys Leu 
545 550 

ATTCTTAGCT ATTGTAATAC TCTAGCGGGC TGCAGGAATT CGATATCA 



4702 



4750 



4798 



4846 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Met Glu Asp Ala Lys Asn lie Lys Lys Gly fo Ala Pro Phe Tyr Pro 



Leu Glu Asp Gly Thr Ala Gly Gli 
20 ' 

Tyr Ala Leu Val Pro Gly Thr lie 
35 40 

Val Asn He Thr Tyr Ala Glu Tyi 
50 55 



25 



His Lys Ala Met Lys Arg 
30 

Thr Asp Ala His He Glu 
45 

vGlu Met Ser Val Arg Leu Ala 
60 



Glu Ala Met Lys Arg Tyr Gly/eu Asn Thr Asn His Arg He Val Val 

65 70 / 

Cys Ser Glu Asn Ser Leu oln Phe Phe Met Pro Val Leu Gly Ala Leu 

Phe He Gly Val Ala v/*la Pro Ala Asn Asp He Tyr Asn Glu Arg 

100 / 10b 

Glu Leu Leu Asn Se/ M et Asn lie Ser Gin Pro Thr Val Val Phe Val 

115 / 120 

Ser Lys Lys Gly/eu Gin Lys Xle Leu Asn Val Gin Lys Lys Leu Pro 

lie lie Gin /s He He He Met Asp Ser Lys Thr Asp Tyr Gin Gly 
145 / 150 

Phe Gin s4 Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe 

Y 165 170 

Asn g/ Tyr Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr lie 

A l/ L eu He Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val 

195 200 
/4a Leu Pro His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp 
210 215 220 
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Pro lie Phe Gly Asn Gin lie He Pro Asp Thr Ala He Leu SerWi 
225 230 / 

Val Pro Phe His His Gly Phe Gly Met Phe Thr Thr Leu Gly T/r Leu 

245 ZDU / 

He Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe Glu G^ Glu Leu 

260 26b / 

Phe Leu Arg Ser Leu Gin Asp Tyr Lys He Gin Ser A^ Leu Leu Val 

275 -6oU y 

Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Le/lle As P Lys Tyr 

ASP Leu Ser Asn Leu His Glu lie Ala Ser 6j/ 31y H* Pro Leu Ser 

305 310 / 

Lys Glu val Gly Glu Ala Val Ala Lys Ar^ Phe His Leu Pro Gly He 



Arg Gin Gly Tyr Gly Leu Thr 
340 y 

Pro Glu Gly Asp Asp Lys Pi 
355 

Phe Glu Ala Lys Val Val Asp 
370 375 >* 



'Thr Ser Ala He Leu He Thr 
350 



Gly Lys Val Val Pro Phe 
365 



^Thr Gly Lys Thr Leu Gly Val 
380 

~_/ tv\« piv Pro Met He Met Ser Gly 
Asn Gin Arg Gly Glu Leu c/s Val Arg Gly Pro Mec ^ 

385 390 / 

^r val Asn Asn Pro Gl/ Ala Thr Asn Ala Leu He Asp Lys Asp Gly 

y 405 / 410 

Trp Leu His Ser Gly/sp He Ala Tyr Trp Asp Glu Asp Glu His Phe 

420 / 42b 
Phe He Val Asp dg Leu Lys Ser Leu He Lys Tyr Lys Gly Tyr Gin 
435 / 440 



Val Ala Pro 
450 



Glu Leu Glu Ser He Leu Leu Gin His Pro Asn He 
455 460 



Phe Asp Ala/ly Val Ala Gly Leu Pro Asp Asp Asp Ala Gly Glu Leu 

465 / 470 

Pro Ala A/a Val Val Val Leu Glu His Gly Lys Thr Met Thr Glu Lys 

/ 485 

Glu Il/val Asp Tyr Val Ala Ser Gin Val Thr Thr Ala g. Lys Leu 

7 500 50b 

Arg L Gly Val Val Phe Val Asp Glu Val Pro Lys Gly Leu Thr Gly 

/ 515 520 

L eu Asp Ala Arg Lys lie Arg Glu He Leu He Lys Ala Lys Lys 

530 535 

3ly Gly Lys Ser Lys Leu 
/545 550 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : circular 

<ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Photorhabdus J^uminescens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: pTetLuxl 



(ix) FEATURE: 

(A) NAME /KEY : mi 

(B) LOCATION: jo/nU, 
(D) OTHER INFQ 

/note= 
f eatur^, de 
/citatios^ *M2] ; 



ature 

3190, 10140. .10220) 

3 1 andar d__name = "vector pASK75" 
\plasmid originating from vector pASK75; 
Jon below, SEQ ID NO: 9-11. n 



(ix) FEATURE: 
(A) 
(B) 
(D) OTHER 



NAME /KEY 
LOCATIOJ 



CDS 
3634. .5082 
'ORMATION: /product = 
;ation= ([1]) 



"Lux C" 



(ix) FEATURE 
(A) N 



S=Stt: 

Q 
£3 



/KEY: CDS 
(B) LOCATION: 5097. .6017 

(D) o/HER INFORMATION: /product= "Lux D" 
/citation= {[1]) 



(ix) 



NAME/ KEY: CDS 
i) LOCATION: 6069. .7148 

)) OTHER INFORMATION: /product = "Lux A" 
/citation= ([1]) 

(ix) /FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7166. .8146 

(D) OTHER INFORMATION: /product = "Lux B" 
/citation= ([1]) 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 8256. .9437 

(D) OTHER INFORMATION: /product = "Lux E" 
/citation= ([1]) 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: Frackman, 

(B) TITLE: Cloning, organization and expression of the 

bio luminescence genes of Xenorhabdus 
lumiminescens s 

(C) JOURNAL: J. Bacterid. 

(D) VOLUME: 172 

(F) PAGES: 5767-5773 

(G) DATE: 1990 

(K) RELEVANT RESIDUES IN SEQ ID NO: 3: FROM 3191 TO 10139 
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(x) PUBLICATION INFORMATION: / 

(A) AUTHORS: Skerra, A / 

(B) TITLE: Use of the tetracycline promoter tor tae 

tightly regulated production of a murine antibody 
fragment in Escherichia coli / 

(C) JOURNAL: Gene / 

(D) VOLUME: 151 / 

(E) ISSUE: 1-2 / 

(F) PAGES: 131-135 / 

(G) DATE* 30 -DEC— 1994 / 

(K) RELEVANT RESIDUES IN SEQ ID NO: 3: F/OM 1 TO 3190 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: / 

AGCTTGACCT GTGAAGTGAA AAATGGCGCA CATTGTGCGA /tTTTTTTT GTCTGCCGTT 
TACCGCTACT GCGTCACGGA TCTCCACGCG C CCTGTAGGG GCGCATTAAG CGCGGCGGGT 
gtggtggtta cgcgcagcgt gaccgctaca cttgcc^ecg CCCTAGCGCC CGCTCCTTTC 

GCTTTCTTCC CTTCCTTTCT CGCCACGTTC GCCG^TTTC CCCGTCAAGC TCTAAATCGG 
GGGCTCCCTT TAGGGTTCCG ATTTAGTGCT TyfcGGCACC TCGACCCCAA AAAACTTGAT 
TAGGGTGATG GTTCACGTAG TGGGCCATCG GCCTGATAGA CGGTTTTTCG CCCTTTGACG 
TTGGAGTCCA CGTTCTTTAA TAGTGGACTc/ TTGTTCCAAA CTGGAACAAC ACTCAACCCT 
ATCTCGGTCT ATTCTTTTGA TTTATAaIgG ATTTTGCCGA TTTCGGCCTA TTGGTTAAAA 
AATGAGCTGA TTTAACAAAA ATTTA^Gj AATTTTAACA AAATATTAAC GCTTACAATT 
TCAGGTGGCA CTTTTCGGGG AAAT^TG^G^C GGAACCCCTA TTTGTTTATT TTTCTAAATA 
CATTCAAATA TGTATCCGCT CAT&AGACAA TAACCCTGAT AAATGCTTCA ATAATATTGA 
AAAAGGAAGA GTATGAGTAT JCAACATTTC CGTGTCGCCC TTATTCCCTT TTTTGCGGCA 
TTTTGCCTTC CTGTTTTTGC/tCACCCAGAA ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT 
CAGTTGGGTG CACGAGTOfe TTACATCGAA CTGGATCTCA ACAGCGGTAA GATCCTTGAG 
AGTTTTCGCC CCGAAGAACG TTTTCCAATG ATGAGCACTT TTAAAGTTCT GCTATGTGGC 
GCGGTATTAT CCCGTATTGA CGCCGGGCAA GAGCAACTCG GTCGCCGCAT ACACTATTCT 
CAGAATGACT TGGJK"GAGTA CTCACCAGTC ACAGAAAAGC ATCTTACGGA TGGCATGACA 
GTAAGAGAAT T^TGCAGTGC TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT 
CTGACAACGA /cGGAGGACC GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT 
GTAACTCGc/ TTGATCGTTG GGAACCGGAG CTGAATGAAG CCATACCAAA CGACGAGCGT 
GACACCAOGA TGCCTGTAGC AATGGCAACA ACGTTGCGCA AACTATTAAC TGGCGAACTA 
cttacWtag CTTCCCGGCA ACAATTGATA GACTGGATGG AGGCGGATAA AGTTGCAGGA 
CCACTfcTGC GCTCGGCCCT TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT 
GAGCGTGGCT CTCGCGGTAT CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC 
GTAGTTATCT ACACGACGGG GAGTCAGGCA ACTATGGATG AACGAAATAG ACAGATCGCT 
ga4ataggtg CCTCACTGAT TAAGCATTGG TAGGAATTAA TGATGTCTCG TTTAGATAAA 
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AGTAAAGTGA TTAACAGCGC ATTAGAGCTG CTTAATGAGG TCGGAATCGA AGS 
ACCCGTAAAC TCGCCCAGAA GCTAGGTGTA GAGCAGCCTA CATTGTATTG 
AATAAGCGGG CTTTGCTCGA CGCCTTAGCC ATTGAGATGT TAGATAcWcCATACTCAC 
TTTTGCCCTT TAGAAGGGGA AAGCTGGCAA GATTTTTTAC GTAATa/gC TAAAAGTTTT 
AGATGTGCTT TACTAAGTCA TCGCGATGGA GCAAAAGTAC ATTT^GTAC ACGGCCTACA 
GAAAAACAGT ATGAAACTCT CGAAAATCAA TTAGCCTTTT Ta/gCCAACA AGGTTTTTCA 
CTAGAGAATG CATTATATGC ACTCAGCGCA GTGGGGCATT/TTACTTTAGG TTGCGTATTG 
GAAGATCAAG AGCATCAAGT CGCTAAAGAA GAAAGGGAaI CACCTACTAC TGATAGTATG 
CCGCCATTAT TACGACAAGC TATCGAATTA TTTGA^ACC AAGGTGCAGA GCCAGCCTTC 
TTATTCGGCC TTGAATTGAT CATATGCGGA TTAoUc AACTTAAATG TGAAAGTGGG 
TCTTAAAAGC AGCATAACCT TTTTCCGTGA TOGTAACTTC ACTAGTTTAA AAGGATCTAG 
GTGAAGATCC TTTTTGATAA TCTCATGACCy'AAAATCCCTT AACGTGAGTT TTCGTTCCAC 
TGAGCGTCAG ACCCCGTAGA AAA^A^ GGATCTTCTT GAGATCCTTT TTTTCTGCGC 
GTAATCTGCT GCTTGCAAAC AaW» CCGCTACCAG CGGTGGTTTG TTTGCCGGAT 
CAAGAGCTAC CAACTCTTTT 4cdU ACTGGCTTCA GCAGAGCGCA GATACCAAAT 
ACTGTCCTTC TAGTGTAGCC GTAGTTAGGC CACCACTTCA AGAACTCTGT AGCACCGCCT 
ACATACCTCG CTCTGCTAAT AgTTACCA GXGGCTGCTG CCAGTGGCGA TAAGTCGTGT 
CTTACCGGGT TGGACTCAa/ ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGGCTGAACG 
GGGGGTTCGT GCACACa/c CAGCTIGGAG CGAACGACCT ACACCGAACT GAGATACCTA 
CAGCGTGAGC TATGAGAAAG CGCCACGCTT CCCGAAGGGA GAAAGGCGGA CAGGTATCCG 
GTAAGCGGCA GGG^GGAAC AGGAGAGCGC ACGAGGGAGC TTCCAGGGGG AAACGCCTCG 
TATCTTTATA G^TCGG GTTTCGCCAC CTCTGACTTG AGCGTCGATT TTTGTGATGC 
TCGTCAGGGG OGCGGAGCCT ATGGAAAAAC GCCAGCAACG CGGCCTTTTT ACGGTTCCTG 
GCCTTTTGCyeGCCTTTTGC TCACATGACC CGACACCATC GAATGGCCAG ATGATTAATT 
cctaattt/t GTTGACACTC TATCATTGAT AGAGTTATTT taccactccc TATCAGTGAT 
agagaaa/gt GAAATGAATA GTTCGACAAA AATCTAGATA ACGAGGGCAA aaaatgaaaa 
7 TAT CGCGATTGCA GTGGCACTGG CTGGTTTCGC TACCGTAGCG CAGGCCTGAG 
ACCAGAATTC TTCTTTAGAA ATCTGCCGGT AAAAATTAGA TTGCTATTCA ATCTATTTCT 
ATC^TATTT GTGAAATAAT ACTCAGGATA ATAATTTACA TAAATATTAT CACGCATTAG 
AGMGAGCAT GACTTTTTTA ATTTAAACTT TTCATTAACA AATCTTGTTG ATATGAAAAT 
rCCTTTGC TATTTTAACA GATATTAAAA CGGGAATAGG CGTTATATTG ACGATCCATT 
^AGTTAGATT AAAAACCTTG AGCAGAAAAT TTATATTATT ATCATAATTA TGACGAAAGT 
TACAGGCCAG GAACCACGTA GTCAGAATCT GATTTTCTAT ATATTTGTTA TTTACATCGT 
CATAACACAA AAATATAAGA AGCAAGTGTT GGTACGACCA GTTCGCAAGA TAGTTAAACA 
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GCAACTTAAG TTGAAATTAC CCCCATTAAA TGG ATG GCA £T Met JS ^ 

555 



ATT TCA TTC ATT ATT AAC GGC CAG GTT 
He Ser Phe He He Asn Gly Gin Val 
560 565 

GAT TTA GTG CAA TCC ATT AAT TTT GGT 
Asp Leu Val Gin Ser He Asn Phe Gly 
575 580 

ATA TTG AAT GAC TCT CAT GTA AAA AAC 
He Leu Asn Asp Ser His Val Lys Asn 
590 595 

AAC GAA TTA CGG TTG CAT AAC ATT GTC 
Asn Glu Leu Arg Leu His Asn He Val 
610 

CAA AGA TGG AAA AAT GAA GAA TAC TCA 
Gin Arg Trp Lys Asn Glu Glu Tyr Ser 
625 630 

GAC TTA AAA AAA TAT ATG GGA TAT 
Asp Leu Lys Lys Tyr Met Gly Tyr S^ 
640 

GCC AAT TGG ATA TCT ATG A3 
Ala Asn Trp He Ser Met XI 
655 66 ^ 



GAA ATC TTT CCC GAA Aflt GAT 
Glu He Phe Pro Glu 0bx Asp 
570 

GAT AAT AGT GTT /kQ CTG CCA 
Asp Asn Ser Val/Tyr Leu Pro 
585 

ATT ATT GAT yTGT AAT GGA AAT 
He He Asj/ Cys Asn Gly Asn 
600 / 605 

AAT TTT/CTC TAT ACG GTA GGG 
Asn Ph£ Leu Tyr Thr Val Gly 
615 / 620 

^CGC AGG ACA TAC ATT CGT 
Arg Arg Thr Tyr He Arg 
635 

'GAA GAA ATG GCT AAG CTA GAG 
Glu Glu Met Ala Lys Leu Glu 
650 

TCT AAA GGC GGC CTT TAT GAT 
Ser Lys Gly Gly Leu Tyr Asp 
665 



\CGC 



CAT ATC ATG GAT GAA TGG CTA 
His He Met Asp Glu Trp Leu 
680 685 

TTT CCG AAA GGT AAA TCT GTA 
Phe Pro Lys Gly Lys Ser Val 
695 700 



GTT GTA GAA AAT GAA CTT GGy TCT 
Val Val Glu Asn Glu Leu Gfi/ Ser 
670 675 

CCT CAG GAT GAA AGT TAt/ GTT CGG GCT 
Pro Gin Asp Glu Ser Ty/ Val Arg Ala 
690 7 

SS2S SS S 2 S S S K 1 S S 

705 / 710 

720 / 725 

L C n T gca TTA GCG TTA AGT TTT ATT GAT GTA GAC 

tC SI ten fli Su Ala Leu Ser Phe He Asp Val Asp 
740 745 



GAT CCT TTT 
Asp Pro Phe 

ssssasssgsssssssss 

750 / 755 

C» OOt/oXT A=A « CTC « £ M MT ATG CGA CAT «J f» «J 

Gin Glf Asp Thr Ser Leu Ala Lys Glu lie wet mry 
' 770 775 

ATT *TC GCT TGG GGA GGG CCA GAT GCG ATT AAT TGG GCG GTA GAG CAT 
He Aral Ala Trp Gly Gly Pro Asp Ala He Asn Trp A±a va^ 
785 790 
^ Prft ttt tat GCT GAT GTG ATT AAA TTT GGT TCT AAA AAG AGT CTT 
S fro Sr £a Asp val He Lys Phe Gly Ser Lys Lys Ser Leu 
800 805 
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GGT GCG/ 
Gly Ala 



SS £ £ SS i£ S SS SS S S E £ £2 S 

815 820 
GCT CAT GAT GTT TGT TTT TAG GAT CAG CGA GCT TGT TTT TCT 
Sa Ms Asp val Cys Phe Tyr Asp Gin Arg Ala Cys Phe Ser 

830 835 

AAC ATA TAT TAG ATG GGA AAT CAT TAT GAG GAA TTT AAG TH 
Asn He Tyr Tyr Met Gly Asn His Tyr Glu Glu Phe Lys La 
850 

ATA GAA AAA CTT AAT CTA TAT GCG CAT ATA TTA CCG GCC : AAA AAA 

lie Glu Lys Leu Asn Leu Tyr Ala His He Leu fro 
865 870 



GCC/ CAA 
Ala Gin 
845 

GCG TTG 
Ala Leu 
860 



GAT TTT GAT GAA AAG GCG GCC TAT 
Asp Phe Asp Glu Lys Ala Ala Tyr 
880 885 

TTT GCT GGA TTA AAA GTA GAG GTG 
Phe Ala Gly Leu Lys Val Glu Val 
895 900 

ATT GAG TCA AAT GCA GGT GTG GAA 
He Glu Ser Asn Ala Gly Val Glu 
910 9" 

GTG TAC CTT CAT CAC GTC GAT AP 
Val Tyr Leu His His Val Asp As 
930 



TCT TTA GTT CAX AAA GAA AGC TTG 
Ser Leu Val G/n Lys Glu Ser Leu 
890 

GAT ATT CA? CAA CGT TGG ATG ATT 
Asp He H/s Gin Arg Trp Met He 
7 905 

TTT AAT? CAA CCA CTT GGC AGA TGT 
Phe Ash Gin Pro Leu Gly Arg Cys 
920 925 

'GAG CAA ATA TTG CCT TAT GTT 
Gin He Leu Pro Tyr Val 
35 940 



CAA AAA AAT AAG ACG CAA ACC ATA 
Gin Lys Asn Lys Thr Gin Thr lief Se. 

945 / 950 

TTT AAA TAT CGA GAT GCG TTA <SCA TTA 
Phe Lys Tyr Arg Asp Ala Leu/Ala Leu 
960 / 965 

GAA GCA GGA ATG AAT AAC a/a TTT CGA 
Glu Ala Gly Met Asn Asn He Phe Arg 
975 y980 

ATG AGA CCG TTG CAA CG* TTA GTG ACA 
Met Arg Pro Leu Gin A£g Leu Val Thr 
990 995 

TCT AAC TAT ACG GCt/aAG GAT GTT GCG 
Ser Asn Tyr Thr Ala 7 Lys Asp Val Ala 
lOAO 



j(TT TTT CCT TGG GAG TCA TCA 
He Phe Pro Trp Glu Ser Ser 
955 

AAA GGT GCG GAA AGG ATT GTA 
Lys Gly Ala Glu Arg He Val 
970 

GTT GGT GGA TCT CAT GAC GGA 
Val Gly Gly Ser His Asp Gly 
985 

TAT ATT TCT CAT GAA AGG CCA 
Tyr He Ser His Glu Arg Pro 
1000 1005 

GTT GAA ATA GAA CAG ACT CGA 
Val Glu He Glu Gin Thr Arg 
1015 1° 20 



kT AAG TTC CTT GTA TTT GTC CCA TAATAGGTAA 
so Lys Phe Leu Val Phe Val Pro 
y 1030 

( T GAA TCA AAA TAT AAA ACC ATC GAC CAC GTT ATT TGT 
si ^ Ser Ss Tyr Lys Thr lie Asp Hxs Val He Cys 
5 10 



TTC CTG GAA GAA 
Phe Leu Glu Glu 
102/ 

AAGT ATG GAA 
Met Glu 

GTT GAA GGA/AAT AAA AAA ATT CAT GTT TGG GAA ACG CTG CCA GAA GAA 

Val Glu Gly /Asn Lys Lys He His Val Trp biu 
' 20 25 

i ~ *™ i\ a i*"* a at arc ATT ATT ATT GCG TCT GGT TTT GCC 
AAC AGC CCA AAG AGA AAG AAT GCC ATT ATT Al ^ ^ 

Asn Ser Pr/o Lys Arg Lys Asn Ala lie lie lie aj. * 
35 40 
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y3 



w Aaa ATG gat CAT TTT GCT GGT CTG GCG GAA TAT TTA TCG CGG 

22 2S ^ Phe Ala Gly Leu Ala Glu Tyr Leu Ser Arg to 
50 55 
^* ttt rAT pTG ATC CGC TAT GAT TCG CTT CAC CAC GTT GGA T^G AGT 
SS J2 SK SS fll S Tyr Asp Ser Leu His His Val Gly Zeu Ser 
65 70 7 5 / 

mpjv ACA ATT GAT GAA TTT ACA ATG TCT ATA GGA AAG Q&G AGC TTG 

££ SS i£ Glu Phe Thr Met Ser lie Gly Lys/Gln Ser Leu 

80 85 

115 120 7 

130 I 35 7 14U 



TTA GGG/TTT GAT TAT CTC AGT CTA CCC 
Phe Asp Tyr Leu Ser Leu Pro 
155 




/GAT 
,Ast 



GAA GGC CAT AAA TTG GGT 
She Glu Gly His Lys Leu Gly 
170 175 



AGA TAT TCT CTT GAA AGA GCT 
Arg Tyr Ser Leu Glu Arg Ala 
145 150 

ATT AAT GAA TTG CCG GAT AA' 
He Asn Glu Leu Pro Asp As 
160 165 

GCT GAA GTC TTT GCG AGA 
Ala Glu Val Phe Ala Arg Ast 
180 



/TGT CTT GAT TTT GGT TGG GAA GAT TTA 
'cys Leu Asp Phe Gly Trp Glu Asp Leu 
185 190 



TTT ACT GCA AAT AAC 
Phe Thr Ala Asn Asn y 
210 



*AT AAT TGG GTC AAG CAA GAT GAA GTT ATC ACA 
% % S Trp Val Lys Gin Asp Glu Val He Thr 
215 220 



TTG TTA TCA AAT 
Leu Leu Ser Asn 
225 

GGA AGT TCG CA? 
Gly Ser Ser 
240 



TAT CAA TCG 
Tyr Gin Ser, 



jTT 
r Val 



rr CGT AGT AAT CGA TGC AAG ATA TAT TCT TTG TTA 
S Arg Ser Asn Arg Cys Lys lie Tyr Ser Leu Leu 
230 235 

'qAC TTG AGT GAA AAT TTA GTG GTC CTG CGC AAT TTT 
Jsp Leu Ser Glu Asn Leu Val Val Leu Arg Asn Phe 
245 2 ^0 

ACG AAA GCC GCT ATC GCG ATG GAT AAT GAT CAT CTG 
Thr Lys Ala Ala He Ala Met Asp Asn Asp His Leu 
260 265 



SSSS5SSSSSSSSJ2KSSSS 

T 275 280 



50 2 95 



ATT TC7 
He Si 
3( 



CTG TCT TAAAATCTAT TGAGATATTC TATCACTCAA ATAGCAATAT 
Leu Ser 



5285 



5333 



5381 
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AAGGACTCTC T ATG AAA TTT GGA AAC TTT TTG CTT ACA TAC CAA CCT CC 
AAGGACTCTQ- ^ ^ Qly ^ phe Leu Thr ^ Qln p ro p^O 

1 5 1° 



CGC 
£ly Arg 



CAA TTT TCT CAA ACA GAG GTA ATG AAA CGT TTG GTT AAA TTA 
3n Ser Gin Thr Glu Val Met Lys Arg Leu Val Lys Leu 
15 20 25 

ATC TCT GAG GAG TGT GGT TTT GAT ACC GTA TGG TTA CTG^AG CAT CAT 
lie Ser Glu Glu Cys Gly Phe Asp Thr Val Trp Leu W.Glu His His 
30 35 40 / 

TTC ACG GAG TTT GGT TTG CTT GGT AAC CCT TAT GCT GCT GCA TAT 

Phe Thr Glu Phe Gly Leu Leu Gly Asn Pro Tyr Val Ala Ala Ala Tyr 
50 55 x ou 



TTA CTT GGC GCG ACT AAA AAA TTG AAT GTA 
Leu Leu Gly Ala Thr Lys Lys Leu Asn Val 
65 7 0 



!a ACT GCC GCT ATT GTT 
Sly Thr Ala Ala lie Val 
75 



CTT CCC ACA GCC CAT CCA GTA CGC CAA 
Leu Pro Thr Ala His Pro Val Arg Gin 
80 35 



f _ GAA GAT GTG AAT TTA TTG 
Leu Glu Asp Val Asn Leu Leu 
90 



GAT CAA ATG TCA AAA GGA CGA TTT 
Asp Gin Met Ser Lys Gly Arg Phe 
95 100 



TAC AAC AAG GAC TTT CGC 
Tyr Asn Lys Asp Phe Arg 
110 H5 

GCC TTA GCG GAA TGC TGG 
Ala Leu Ala Glu Cys Trp 
130 

GGA TAT ATG GAA GCT 
Gly Tyr Met Glu Ala As 
145 



SG TTT GGT ATT TGC CGA GGG CTT 
rg Phe Gly He Cys Arg Gly Leu 
105 

ACA GAT ATG AAT AAC AGT CGC 
Thr Asp Met Asn Asn Ser Arg 
120 125 

v***^tG ATA AAG AAT GGC ATG ACA GAG 
Gly Leu He Lys Asn Gly Met Thr Glu 
135 I 40 

'AAT GAA CAT ATC AAG TTC CAT AAG GTA AAA 
Asn Glu His He Lys Phe His Lys Val Lys 
150 155 



GTA AAC CCC GCG GCG 
Val Asn Pro Ala hlj 
160 



/TAT AGC AGA GGT GGC GCA CCG GTT TAT GTG GTG 
Tyr Ser Arg Gly Gly Ala Pro Val Tyr Val Val 
165 I 70 



GCT GAA TCA 
Ala Glu Ser 
175 

ATG ATA TTA 
Met He Leu 
190 

GAG CTT TA? 
Glu Leu 



GCT tfCG ACG ACT GAG TGG GCT GCT CAA TTT GGC CTA CCG 
Sa/Ser Thr Thr Glu Trp Ala Ala Gin Phe Gly Leu Pro 
180 I 85 

J T TGG ATT ATA AAT ACT AAC GAA AAG AAA GCA CAA CTT 
2er Trp He He Asn Thr Asn Glu Lys Lys Ala Gin Leu 
* 195 200 205 

AAT GAA GTG GCT CAA GAA TAT GGG CAC GAT ATT CAT AAT 
J£ Sal Ala Gin Glu Tyr Gly His Asp He His Asn 
210 215 220 

ATC GAC /at TGC TTA TCA TAT ATA ACA TCT GTA GAT CAT GAC TCA ATT 
ill a!p fit! £5 Su Ser Tyr He Thr Ser Val Asp Kxb Asp Ser lie 
225 230 

AAA GCd AAA GAG ATT TGC CGG AAA TTT CTG GGG CAT TGG TAT GAT TCT 
Lys All Lys Glu He Cys Arg Lys Phe Leu Gly His Trp Tyr Asp Ser 
Y P 240 245 250 

TAT g/g AAT GCT ACG ACT ATT TTT GAT GAT TCA GAC CAA ACA AGA GGT 
Tyr ^1 Asn Ala Thr Thr He Phe Asp Asp Ser Asp Gin Thr Arg Gly 
/ 55 260 265 
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TAT GAT TTC AAT AAA GGG CAG TGG CGT GAC TTT GTA TTA AAA GGA g 
Tyr Asp Phe Asn Lys Gly Gin Trp Arg Asp Phe Val Leu Lys uxy w 
270 275 280 

^ GAT ACT AAT CGC CGT ATT GAT TAC AGT TAC GAA ATC AAT CCC/GTG 
Lys Asp Thr Asn Arg Arg lie Asp Tyr Ser ryr ^ 

290 23 5 . 

GGA ACG CCG CAG GAA TGT ATT GAC ATA ATT CAA AAA GAC A^ GAT GCT 
Gly Thr Pro Gin Glu Cys He Asp lie He Gin Lys Asp 
305 310 ' 

s s s s s s s: i £ - Si S £ - 

- s; s s s s s 5 s s ss ss s s 

335 340 



TTT CTT AAA GAA AAA CAA CGT TCG CTA 
Phe Leu Lys Glu Lys Gin Arg Ser Leu L€ 
350 355 



/ TAT TAGCTAAGGA GAAAGAA 

Tyr 
360 



ATG AAA TTT GGA TTG TTC TTC CTT 
Met Lys Phe Gly Leu Phe Phe^ 
1 5 

CAA GAA CAA AGT ATA GTT tfGC ATq 
Gin Glu Gin Ser He Val 
20 

AAG TTG AAT TTT GAA CAG ATT, 
Lys Leu Asn Phe Glu Gin lie 
35 



TTA 
Leu 
40 



AAC/tTC ATC AAT TCA ACA ACT GTT 
Phe He Asn Ser Thr Thr Val 
10 15 

ATA ACG GAG TAT GTT GAT 
He Thr Glu Tyr Val Asp 
30 

AaT GAA AAT CAT TTT TCA GAT 
Tvr Glu Asn His Phe Ser Asp 
45 



AAT GGT GTT GTC GGC GCT 
Asn Gly Val Val Gly Ala, 
50 



•CT CTG ACT GTT TCT GGT TTT CTG CTC GGT 
'Pro Leu Thr Val Ser Gly Phe Leu Leu Gly 
55 60 



TTA ACA GAG AAA ATT 
Leu Thr Glu Lys He 
65 



ATT GGT TCA TTA AAT CAC ATC ATT ACA ACT 
He Gly Ser Leu Asn His He He Thr Thr 
70 75 80 



CAT CAT CCT GTC GCtf ATA GCG GAG GAA GCT TGC TTA TTG GAT CAG TTA 
S£ Ss Pro vll a| He Ala Glu Glu Ala Cys Leu Leu Asp Gin Leu 

_ Tvrrwp TTA GGG AGT GAT TGC GAA AAA AAA GAT 

S SS S? Sg/S & 2 S S ser Asp cys Glu g. l-ys asp 

109 10:3 

. nnn rrr rTT qpj, TAT CAA CAG CAA CTA TTT 

SS E 35 32 £ » S S ?S Si ?yr <*„ s ^ - ». 

115 / 120 

145 / I 50 155 

/ „™ r*nn r-nn rfT rm AAA TAT GTA ACA GCA ACC AGT 

£3£ S Pro S SSSS val Thr Al. Thr Ser 

165 170 
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CAT CAT ATT GTT GAG TGG GCG GCC AAA AAA GGT ATT CCT CTC ATC TTT 
SI Se Sal Su Trp Ala Ala Lys Lys Gly He Pro Leu He Ph/ 

180 I 85 

air Tvjp GAT gat tct AAT GAT GTT AGA TAT GAA TAT GCT GAA AG^/TAT 
Ss m Sp Sp Sr £n Asp Val Arg -Tyr Glu Tyr Ala Glu arf Tyr 
195 200 20i 



AAA GCC GTT GCG GAT AAA TAT GAC GTT GAC CTA TCA GAG 
J££ Sa Val Ala Asp Lys Tyr Asp Val Asp Leu Ser Glu, 
210 215 220 / 

CAG TTA ATG ATA TTA GTT AAC TAT AAC GAA GAT AGT/^AT 
Gin Leu Met He Leu Val Asn Tyr Asn Glu Asp Se^ Asn 
225 230 235 



CAA GAG ACG CGT GCA TTT ATT AGT GAT TAT G' 
Gin Glu Thr Arg Ala Phe He Ser Asp Tyr 
245 250, 



CTT GAA 
il Leu Glu 



TA GAC CAT 
[le Asp His 



AAA GCT AAA. 
Lys Ala Lys 
240 

ATG CAC CCT 
Met His Pro 
255 



AAT GAA AAT TTC GAA AAT AAA CTT GAA 
Asn Glu Asn Phe Glu Asn Lys Leu 
260 

GTC GGA AAT TAT ACG GAG TG* 
Val Gly Asn Tyr Thr Glu Cy 
275 




ATA ATT GCA GAA AAC GCT 
He He Ala Glu Asn Ala 
270 



.ACT GCG GCT AAG TTG GCA ATT GAA 
/Thr Ala Ala Lys Leu Ala He Glu 

f " 285 



AAG TGT GGT GCG AAA AGT GT^TTG\C 
Lys Cys Gly Ala Lys Ser V 
290 295 



Leu 



PG TCC TTT GAA CCA ATG AAT GAT 
»u Ser Phe Glu Pro Met Asn Asp 
300 



TTG ATG AGC CAA AAA 
Leu Met Ser Gin Lys 
305 

AAG TAC CAC ATG 
Lys Tyr His Met 



GTA ATC AAT ATT GTT GAT GAT AAT ATT AAG 
Val He Asn He Val Asp Asp Asn He Lys 

fio 315 320 

TAT ACC TAATAGATTT CGAGTTGCAG CGAGGCGGCA 
Tyr Thr 



ACTGAACGAA TCC^AGGAG CATAGATAAC TA1CTGACTC GGGTGAGTGA AAGCAGCCAA 
CAAAGCAGCA ffi AAG GGT ATA AAA go « GAC AGC ACT GOT 

1 5 



GCC ATA CI 
Ala lie Le 



TCT AAT ATT ATC TTG AGG AGT AAA ACA GGT ATG ACT TCA 
Sr As*n He lie Leu Arg Ser Lys Thr Gly Met Thr Ser 
15 20 25 

Tifel K §£ SS £ £ £ S 2SSSS 2S S 

30 35 

rT TCG AGC GAT CCA TTA GTG TGG TCT TAC GAC GAG CAG GAA AAA 
ie Ser Ser Asp Pro Leu Val Trp Ser Tyr Asp Glu Gin Glu Lys 

50 55 



ATT 
He 



atp/aGA AAG AAA CTT GTG CTT GAT GCA TTT CGT AAT CAT TAT AAA CAT 
S? JS S JJi Leu Val Leu Asp Ala Phe Arg Asn Hi. Tyr Lys His 

Jct CGA GAA TAT CGT CAC TAC TGT CAG GCA CAC AAA GTA GAT GAC AAT 
25 ™ Arg His Tyr Cys Gin Ala His Lys Val Asp Asp Asn 

80 85 



7741 

7789 

7837 

7885 

7933 

7981 

8029 

8077 

8125 

8176 

8236 
8288 

8336 

8384 

8432 

8480 

8528 



WO 99/25866 



PCT/FI98/00873 



36 



. nvp , rr rAA ATT gat GAC ATA CCT GTA TTC CCA ACA TCG GTT TTT AAG 
S i£ Si 52 S Sp He Pro Val Phe Pro Thr Ser Val Phe Lys 

95 100 AUJ ' 

TTT ACT CGC TTA TTA ACT TCT CAG GAA AAC GAG ATT GAA AGT TGG 
J£ T*£ Arg Leu Leu Thr Ser Gin Glu Asn Glu He Glu Ser Trp, 
no 115 

arr APT AGC GGC ACG AAT GGT TTA AAA AGT CAG GTG GCG CGt/gAC AGA 
5£ Jer Ser SS Asn Gly Leu Lys Ser Gin Val Ala M Asp Arg 

125 130 135 



Phe 



TTA AGT ATT GAG AGA CTC TTA GGC TCT GTG AGT TAT G< 
Leu Ser He Glu Arg Leu Leu Gly Ser Val Ser Tyr 
140 145 i50 



ATG AAA TAT 
Ly Met Lys Tyr 
155 



GTT GGT AGT TGG TTT GAT CAT CAA ATA GAA TTA 
Val Gly Ser Trp Phe Asp His Gin He Glu Ley J 
160 I 65 



„rC AAT TTG GGA CCA 
/al Asn Leu Gly Pro 
170 



GAT AGA TTT AAT GCT CAT AAT ATT TGG 
Asp Arg Phe Asn Ala His Asn He Trp Pr 
175 I 80 



f TAT GTT ATG AGT TTG 

'*Lys Tyr Val Met Ser Leu 
185 



GTG GAA TTG TTA TAT CCT ACG 
Val Glu Leu Leu Tyr Pro Thaf Thr 
190 

GAT TTT GTT AAA ACA TTG / AAT 
Asp Phe Val Lys Thr Leu \ Asn 
205 210, 



rCTT 



AAA GAT CTT TGT CTT ATT 
Lys Asp Leu Cys Leu He 
220 225 ^ 

CAT TAT ATG AAA GAT 
His Tyr Met Lys Asp 
240/ 



2T TCG CC 



'ACC GTA ACA GAA GAA CGA ATA 
Thr Val Thr Glu Glu Arg He 
200 

CGA ATA AAA AAT CAA GGG 
Arg He Lys Asn Gin Gly 
215 

TAC TTT ATT TAT TTA CTC TGC 
Tyr Phe He Tyr Leu Leu Cys 
230 235 



AAA ATC TCA TTT TCT GGA GAT AAA AGC CTT 
Lys He Ser Phe Ser Gly Asp Lys Ser Leu 
245 250 



TAT ATC ATA ACC 
Tyr He He Thr 
255 

AAA CGT GAT 
Lys Arg Asp 
270 



GGC GGC TGG AAA AGT TAC GAA AAA GAA TCT CTG 
Gly Gly Trp Lys Ser Tyr Glu Lys Glu Ser Leu 
260 265 

'tTC AAT CAT CTT TTA TTT GAT ACT TTC AAT CTC AGT 
Phe Asn His Leu Leu Phe Asp Thr Phe Asn Leu Ser 
275 280 



GAT ATT AGT/CAG ATC CGA GAT ATA TTT AAT CAA GTT GAA CTC AAC ACT 
Sp S te/Gln He Arg Asp He Phe Asn Gin Val Glu Leu Asn Thr 
285 / 290 295 



TGT TTC 
Cys Phey 
300 



GAG GAT GAA ATG CAG CGT AAA CAT GTT CCG CCG TGG GTA 
Phe SS HI Glu Met Gin Arg Lys His Val Pro Pro Trp Val 
305 310 



TAT g/g CGA GCG CTT GAT CCT GAA ACG TTG AAA CCT GTA CCT GAT GGA 
fil S 2a Su Asp Pro Glu Thr Leu Lys Pro Val Pro Asp Gly 
320 325 

CCG GGG TTG ATG AGT TAT ATG GAT GCG TCA GCA ACC AGT TAT CCA 
Pro Gly Leu Met Ser Tyr Met Asp Ala Ser Ala Thr Ser Tyr Pro 
335 340 

'gCA TTT ATT GTT ACC GAT GAT GTC GGG ATA ATT AGC AGA GAA TAT GGT 
S 52 He Val Thr Asp Asp Val Gly He He Ser Arg Glu Tyr Gly 
350 355 



AC 



8576 



8624 



8672 



8720 



8768 



8816 



8864 



8912 



8960 



9008 



9056 



9104 



9152 



9200 



9248 



9296 



9344 
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H ! 



AAG TAT CCC GGC GTG CTC GTT GAA ATT TTA CGT 
Lvs Tvr Pro Gly Val Leu Val Glu lie Leu Arg 
365 370 

ACG CAG AAA GGG TGT GCT TTA AGC TTA ACC GAA 
Thr Gin Lys Gly Cys Ala Leu Ser Leu Thr Glu 
380 385 390 

TGATATCCTT TGCCTAATTG TAAGTGGAAT GCTTGCGTTA 

ACACTTTACA AAATTCTCCA AAACATCCAC ATTTGGGTAC 

TGGCTTAACA TTGTTCTCAT TGTTATTATT GGCTCAAAGC 

AATTGGCAGT TATGCTTGCA TTGGGAATGA TTAGCTTTGG 

ATAAAGATGC AAAGTTTGGC ATGACAGAAG AAGAGTTTC 

TTGAAAAATT TGAGGGAGAT TCTCGAATAG AAGAAGT£TC 

TTTCGTTTGC TAACAAAAAG CGTGAAGCAA TGGCAJ 

GATTAGAGAT TAATATTGGC AGACTTGTGA AGOSAGTAAG 

ACGGAGATGG ATCATCGTAT CCATCAAAAG AAGAATTTGA 



CGC GTC AAT ACG AG 
Arg Val Asn Thr 
375 



GCG TTT GAT 
Ala Phe Asp 



Ser 



TATAAATCJG 



GAACTATGTC TATAGGTTAT GAT^TA^T£ 
GCAAAGAAGG CATAGAAACC AG f 
CAGATTTCGG AGAAAAAATC CAC 
GGGGATCCCT CGAGGTCGAC CTGC^GGC* 
ATA 



rAA 



CGGTATTAGT 
^ATACGAG 

^g^gattaaa 
:ttggcgt 



TTGAT^GAGG 
lGAT 

T0CAATGGCA 
TTCGAAGAGG 
ACTTTATTCA 
TTTAAATGGG 
CAAATCGTTA 
GAACGCGCTA 
TGATATACAT 
TCCAGATGTT 
GGAATTCGAG 
CACCCGCAGT 



AATGACATCT 
TTTATGGGGT 
AACATGAAAA 
GTTGATGGGT 
TTATGTGATT 
TGTTCTGACT 
AAATTTAAAA 
ACGAAAAAGT 
AAATACAATG 
ATAATATGTG 
TATACGCTCC 
CTCGGTACCC 
TCGGTGGTTA 



9392 
9437 

9497 
9557 
9617 
9677 
9737 
9797 
9857 
9917 
9977 
10037 
10097 
10157 
10217 
10220 



(2) INFORMATION FOR SBQ 1° NO: 4: 

(i) SEQUENCE ^CHARACTERISTICS: 

(A) LENGTH: 483 amino acids 

(B) TYPE/; amino acid 
( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Asn Met Thr Lys Lys lie Ser Phe lie lie Asn Gly Gin Val 



10 



Glu lie Ph/pro Glu Ser Asp Asp Leu Val Gin Ser He Asn Phe Gly 
r 20 25 



Asp Asn Ser 



Val Tyr Leu Pro He Leu Asn Asp Ser His Val Lys Asn 



35 40 
lie il/ Asp Cys Asn Gly Asn Asn Glu Leu Arg Leu His Asn lie Val 



55 



Asn Phe 



Leu Tyr Thr Val Gly Gin Arg Trp Lys Asn Glu Glu Tyr Ser 



65 / 70 75 



Arst Arg Arg Thr Tyr He Arg Asp Leu Lys Lys Tyr Met Gly Tyr Ser 

85 90 " 

/lu Glu Met Ala Lys Leu Glu Ala Asn Trp lie Ser Met lie Leu Cys 



100 



105 
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Ser Lys Gly Gly Leu Tyr Asp Val Val Glu Asn Glu Leu Gly Ser Arg 
115 120 125 / 

His lie Met Asp Glu Trp Leu Pro Gin Asp Glu Ser Tyr Val Arg Ala 
130 135 140 / 

Phe Pro Lys Gly Lys Ser Val His Leu Leu Ala Gly Asn vA Pro Leu 
145 150 155 / 

Ser Gly He Met Ser He Leu Arg Ala He Leu Thr LVfe Asn Gin Cys 
165 170 / " 175 

He He Lys Thr Ser Ser Thr Asp Pro Phe Thr Ala Asn Ala Leu Ala 
180 185 / 190 

Leu Ser Phe He Asp Val Asp Pro Asn His Pr/lle Thr Arg Ser Leu 
195 200 / 205 

Ser Val He Tyr Trp Pro His Gin Gly Asn/rhr Ser Leu Ala Lys Glu 
210 215 / 220 

He Met Arg His Ala Asp Val He Val /la Trp Gly Gly Pro Asp Ala 
225 230 / 235 240 

He Asn Trp Ala Val Glu His Ala Pro Ser Tyr Ala Asp Val lie Lys 
245 / 250 255 



Phe Gly Ser Lys Lys Ser 
260 

Thr Ser Ala Ala Thr Gly AX 
275 




APs 



He Asp Asn Pro Val Asp Leu 
270 

Asp Val Cys Phe Tyr Asp Gin 
285 



Arg Ala Cys Phe Ser Ala /6ln Asn He Tyr Tyr Met Gly Asn His Tyr 
290 /295 300 

Glu Glu Phe Lys Leu Ma Leu He Glu Lys Leu Asn Leu Tyr Ala His 
305 XlO 315 320 

He Leu Pro Asn Al/ Lys Lys Asp Phe Asp Glu Lys Ala Ala Tyr Ser 

330 335 

Leu Val Gin Lys din Ser Leu Phe Ala Gly Leu Lys Val Glu Val Asp 
340/ 345 350 

He His Gin pJg Trp Met He He Glu Ser Asn Ala Gly Val Glu Phe 
355 7 360 365 

Asn Gin ProAeu Gly Arg Cys Val Tyr Leu His His Val Asp Asn He 
370 / 375 380 

Glu Gin lie Leu Pro Tyr Val Gin Lys Asn Lys Thr Gin Thr He Ser 
385 / 390 395 400 

He Phe/Pro Trp Glu Ser Ser Phe Lys Tyr Arg Asp Ala Leu Ala Leu 
1 405 410 415 

Lys g/v Ala Glu Arg He Val Glu Ala Gly Met Asn Asn lie Phe Arg 
f 420 425 430 

Val /Gly Gly Ser His Asp Gly Met Arg Pro Leu Gin Arg Leu Val Thr 
1 435 440 445 

Ty/ He Ser His Glu Arg Pro Ser Asn Tyr Thr Ala Lys Asp Val Ala 
r 450 455 460 

Jal Glu He Glu Gin Thr Arg Phe Leu Glu Glu Asp Lys Phe Leu Val 
i65 470 475 480 



WO 99/25866 



39 



PCT/FI98/00873 



Phe Val Pro 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

<ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Glu Asn Glu Ser Lys Tyr Lys Thr lie Asp His Val/ lie Cys Val 
1 5 10/15 

Glu Gly Asn Lys Lys lie His Val Trp Glu Thr Leu/Pro Glu Glu Asn 
20 25 / 30 

Ser Pro Lys Arg Lys Asn Ala He He He Ala Aer Gly Phe Ala Arg 
35 40 / 45 

Arg Met Asp His Phe Ala Gly Leu Ala Glu T#r Leu Ser Arg Asn Gly 
50 55 / 60 

Phe His Val He Arg Tyr Asp Ser Leu h/s His Val Gly Leu Ser Ser 
65 70 / 75 80 

Glv Thr He Asp Glu Phe Thr Met Se/ He Gly Lys Gin Ser Leu Leu 
85 / 90 95 



Ala Val Val Asp Trp Leu Tpx 
100 

Leu Ala Ser Ser Leu Ser Aria 

115 /120N 



^g Lys He Asn Asn Phe Gly Met 
\5 HO 

lie Ala Tyr Ala Ser Leu Ser Glu 
125 



He Asn Ala Ser Phe Leu i/e Thr Ala Val Gly Val Val Asn Leu Arg 
130 *35 140 

Tyr Ser Leu Glu Arg Ala 7 Leu Gly Phe Asp Tyr Leu Ser Leu Pro He 
145 i?b 155 160 

Asn Glu Leu Pro Asp Asn Leu Asp Phe Glu Gly His Lys Leu Gly Ala 
\Sf 170 175 

Glu Val Phe Ala Arg Asp Cys Leu Asp Phe Gly Trp Glu Asp Leu Ala 
180 / 185 190 

Ser Thr He As/ Asn Met Met Tyr Leu Asp He Pro Phe He Ala Phe 
195 / 200 205 

Thr Ala Asn Asn Asp Asn Trp Val Lys Gin Asp Glu Val He Thr Leu 
210 / 215 220 

Leu Ser A^n He Arg Ser Asn Arg Cys Lys He Tyr Ser Leu Leu Gly 
225 / 230 235 240 

Ser Ser/His Asp Leu Ser Glu Asn Leu Val Val Leu Arg Asn Phe Tyr 
245 250 255 

Gin Sjer Val Thr Lys Ala Ala He Ala Met Asp Asn Asp His Leu Asp 
260 265 270 

He/Asp Val Asp He Thr Glu Pro Ser Phe Glu His Leu Thr He Ala 
7 275 280 285 
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Thr Val Asn Glu Arg Arg Met Arg He Glu He Glu Asn Gin Ala lie, 
290 295 300 

Ser Leu Ser 
305 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 360 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Lys Phe Gly Asn Phe Leu Leu Thr Tyr Gl/pro Pro Gin Phe Ser 
1 5 10 / 

Gin Thr Glu Val Met Lys Arg Leu Val Ly/ Leu Gly Arg lie Ser Glu 
20 25 7 JU 

Phe Asp Thr Val Trp Leu/Leu Glu His His Phe Thr Glu 



Glu Cys Gly Phe Asp 
35 



40 



45 



Phe Gly Leu Leu Gly Asn Pro Tyr/al Ala Ala Ala Tyr Leu Leu Gly 

55 / 6° 

*ly Thr Ala Ala He Val Leu Pro Thr 



50 



As^Tva^ 



Ala Thr Lys Lys Leu 
65 flO 

Ala His Pro Val Arg Gin 
85 

Ser Lys Gly Arg Phe 
100 



75 



80 



Asp Val Asn Leu Leu Asp Gin Met 
90 95 



Phe Gly He Cys Arg Gly Leu Tyr Asn Lys 
105 HO 



Met 



Asp Phe Arg Val Phe/Cly Thr Asp Met Asn Asn Ser Arg Ala Leu Ala 
115 / 120 1 

Glu Cys Trp TVr cLy Leu He Lys Asn Gly Met Thr Glu Gly Tyr 
130 / 135 

Glu His He Lys Phe His Lys Val Lys Val Asn Pro 
150 155 160 

Ala Ala Tyr ier Arg Gly Gly Ala Pro Val Tyr Val Val Ala Glu Ser 



Glu Ala Asp 
145 



165 



170 



Ala Ser T^r Thr Glu Trp Ala Ala Gin Phe Gly Leu Pro Met He Leu 



180 



185 



Ser Trp/lle He Asn Thr Asn Glu Lys Lys Ala Gin Leu Glu Leu Tyr 

200 205 



195 



Asn Glu Val Ala Gin Glu Tyr Gly His Asp He His Asn He Asp His 
110 2 15 220 

Cy/ Leu Ser Tyr He Thr Ser Val Asp His Asp Ser He Lys Ala Lys 



225 



230 



/lu He Cys Arg Lys Phe Leu Gly His Trp Tyr Asp Ser Tyr Val Asn 



245 



250 



Ala Thr Thr He Phe Asp Asp Ser Asp Gin Thr Arg Gly Tyr Asp Phe 

265 2/0 



260 
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Asn Lys 



Gly Gin Trp Arg Asp Phe Val Leu Lys Gly His Lys Asp Thr, 



Asn Arg 
290 



275 280 

Arg He Asp Tyr Ser Tyr Glu He Asn Pro Val Gly Thr 



295 300 
Gin Glu Cys He Asp lie lie Gin Lys Asp lie Asp Ala Thr lie 



305 310 

He Cys Cys Gly Phe Glu Ala Asn Gly Thr Val A/p Glu He 



Ser Asn 



325 



330 



lie Ala Ser Met Lys Leu Phe Gin Ser Asp Val Met ?fo Phe Leu Lys 



340 

Glu Lys Gin Arg Ser Leu Leu Tyr 
355 360 



345 



fin 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: £EQ ID NO: 7; 



Met Lys Phe Gly Leu Phe 
1 5 

Gin Glu Gin Ser He Val Arc? 

20 7 



Me\ 



.Asn Phe He Asn Ser 
10 

kin Glu He Thr Glu 
25 



Lys Leu Asn Phe Glu Gln/le Leu Val Tyr Glu Asn His 
35 / 40 

Asn Gly Val Val Gly /a Pro Leu Thr Val Ser Gly Phe 
50 / 55 

Le u Thr Glu Lys Il/Lys lie Gly Ser Leu Asn His He 
65 / 70 75 



His His Pro Val Ala He Ala Glu Glu Ala Cys Leu Leu 
/ 85 90 

Ser Glu Gly Jg Phe He Leu Gly Phe Ser Asp Cys Glu 

yoo 105 

Glu Met His/Phe Phe Asn Arg Pro Val Glu Tyr Gin Gin 

Glu Glu Js Tyr Glu He He Asn Asp Ala Leu Thr Thr 

130 7 135 
Asn Pro Lp Asn Asp Phe Tyr Ser Phe Pro Lys He Ser 
145 / 3.50 155 

His All Tyr Thr Pro Gly Gly Pro Arg Lys Tyr Val Thr 
T 165 170 

His i/is He Val Glu Trp Ala Ala Lys Lys Gly He Pro 
/ 180 3.85 

Lys/Trp Asp Asp Ser Asn Asp Val Arg Tyr Glu Tyr Ala 

/ 195 200 



Thr Thr Val 
15 

Tyr Val Asp 
30 

Phe Ser Asp 

Leu Leu Gly 

He Thr Thr 
80 

Asp Gin Leu 
95 



Lys Lys Asp 
110 

Gin Leu Phe 

Gly Tyr Cys 

Val Asn Pro 
160 

Ala Thr Ser 
175 

Leu He Phe 
190 

Glu Arg Tyr 
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Lys Ala val Ala Asp Lys Tyr Asp Val Asp Leu Ser Glu lie Asp 
210 215 220 

Gin Leu Met lie Leu Val Asn Tyr Asn Glu Asp Ser Asn Lys Ala^ys 
225 230 ^ b 

Gin Glu Thr Arg Ala Phe lie Ser Asp Tyr Val Leu Glu Met H/s Pro 
245 250 

Asn Glu Asn Phe Glu Asn Lys Leu Glu Glu lie He Ala G*d Asn Ala 
260 265 



val Gly Asn Tyr Thr Glu Cys He Thr Ala Ala Lys W Ala lie Glu 

275 280 r 5 

Lys Cys Gly Ala Lys Ser Val Leu Leu Ser Phe Glu/pro Met Asn Asp 



290 295 
Leu Met Ser Gin Lys Asn Val lie Asn He ValAsp Asp Asn lie Lys 
305 310 33* 



Lys Tyr His Met Glu Tyr Thr 
325 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTIC 

(A) LENGTH: 394 ^amino acids 

(B) TYPE: amir 
(D) TOPOLOGY/ 

(ii) MOLECULE TYH . 
(xi) SEQUENCE DESCRIPsfc^: SfeQ ID NO: 8: 

Met Lys Gly He Lys Glu Ty/V Ser S % A1 * Ala 116 M *S *" 

1 5 / - LU 

He lie Leu Arg Ser Lys/hr Gly Met Thr Ser Tyr Val Asp Lys Gin 

20 / 25 

Glu lie Thr Ala Ser s4 Glu He Asp Asp Leu He Phe Ser Ser Asp 



Pro Leu 
50 



35 / 40 45 

Val Trp Se/ Tyr Asp Glu Gin Glu Lys lie Arg Lys Lys Leu 

' —- 60 



55 



Val Leu Asp Ala/Phe Arg Asn His Tyr Lys His Cys Arg Glu Tyr Arg 
65 7 70 75 

His Tyr Cys tsL Ala His Lys Val Asp Asp Asn He Thr Glu He Asp 

/ 85 yu 
Asp lie Pro/val Phe Pro Thr Ser Val Phe Lys Phe Thr Arg Leu Leu 



100 i° 5 

A r-T-. Cl~\ 11 Tip 

120 125 



Thr Ser G /n Glu Asn Glu He Glu Ser Trp Phe Thr Ser Ser Gly Thr 



Asn Gly/Leu Lys Ser Gin Val Ala Arg Asp Arg Leu Ser lie Glu Arg 
13(v 3-35 ■ Lflu 



Leu Le'u Gly Ser Val Ser Tyr Gly Met Lys Tyr Val Gly Ser Trp Phe 
145 / 150 

Asp His Gin He Glu Leu Val Asn Leu Gly Pro Asp Arg Phe Asn Ala 
r r lg5 170 
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His Asn lie Trp Phe Lys Tyr Val Met Ser Leu Val Glu Leu Leu/Tyr 
180 185 190 / 

Pro Thr Thr Phe Thr Val Thr Glu Glu Arg lie Asp Phe Val/Lys Thr 
195 200 205 / 

Leu Asn Ser Leu Glu Arg lie Lys Asn Gin Gly Lys Asp/Leu Cys Leu 
210 215 220 7 

lie Gly Ser Pro Tyr Phe He Tyr Leu Leu Cys His >fyr "Met Lys Asp 
225 230 235 / 240 

Lys Lys He Ser Phe Ser Gly Asp Lys Ser Leu for He He Thr Gly 
245 250 / 255 

Gly Gly Trp Lys Ser Tyr Glu Lys Glu Ser Leu Lys Arg Asp Asp Phe 
260 265 / 270 

Asn His Leu Leu Phe Asp Thr Phe Asn Lea Ser Asp He Ser Gin He 
275 280 / 285 

Arg Asp He Phe Asn Gin Val Glu Leu 7 Asn Thr Cys Phe Phe Glu Asp 
290 295 / 300 

Glu Met Gin Arg Lys His Val Pro/Pro Trp Val Tyr Ala Arg Ala Leu 
305 310 / 315 320 

Asp Pro Glu Thr Leu Lys Pro Val Pro Asp Gly Thr Pro Gly Leu Met 
325 r\ / 330 335 



Ser Tyr Met Asp Ala S€ 
340 

Asp Asp Val Gly He Hey 
355 



Sei 



Thr Ser Tyr Pro Ala Phe He Val Thr 
345 350 

eg Glu Tyr Gly Lys Tyr Pro Gly Val 
50 365 



Leu Val Glu He Leu Arg Arg Val Asn Thr Arg Thr Gin Lys Gly Cys 
370 / 375 380 

Ala Leu Ser Leu Th/ Glu Ala Phe Asp Ser 
385 / 390 

(2) INFORMATION y£oR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) /LENGTH: 3098 base pairs 

(B) /TYPE: nucleic acid 
<c/ STRANDEDNESS : double 
{Dp TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: pASK75 

(viii)/ POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT: vector 



(i>/) FEATURE: 

(A) NAME /KEY : promoter 

(B) LOCATION: 542. .672 

(D) OTHER INFORMATION: /function= "beta-la promoter 1 " 
/label= beta-la 
/citation= ([1]) 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 673 . .1530 

(D) OTHER INFORMATION: /product = "beta-la" 
/citation= ([1]) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1543. .2163 

(D) OTHER INFORMATION: /product = "tetR" 
/citation^ ([1]) 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION:2713 . .2950 
(D) OTHER INFORMATION: /function= 

/label= ORI 
/citations ([1]) 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION: 297 6. .3073 

(D) OTHER INFORMATION: / f un<^cion= "p tetA promoter 
/citation= 



.1) 



(x) PUBLICATION INFORJX&TI0N: 

(A) AUTHORS: Ske 

(B) TITLE: Use o 

tightly r£guj 
fragment in 

(C) JOURNAL: Gene 

(D) VOLUME: 151 

(E) ISSUE: 1-2 

(F) PAGES: 131-^35 

(G) DATE: 30-IJEC-1994 
(K) RELEVANT 



^racycline promoter for the 

) product ion of a murine antibody 

tichia coli 



ESIDUES IN SEQ ID NO: 9: FROM 1 TO 3098 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 

AGCTTGACCT GTGAAGTfeAA AAATGGCGCA CATTGTGCGA CATTTTTTTT GTCTGCCGTT 

TACCGCTACT GCGTC&CGGA TCTCCACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT 

GTGGTGGTTA CG9GCAGCGT GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC 

GCTTTCTTCC O^TCCTTTCT CGCCACGTTC GCCGGCTTTC CCCGTCAAGC TCTAAATCGG 

GGGCTCCCTl/ TAGGGTTCCG ATTTAGTGCT TTACGGCACC TCGACCCCAA AAAACTTGAT 

TAGGGTGA^G GTTCACGTAG TGGGCCATCG CCCTGATAGA CGGTTTTTCG CCCTTTGACG 

TTGGAGTCCA CGTTCTTTAA TAGTGGACTC TTGTTCCAAA CTGGAACAAC ACTCAACCCT 

ATCTC^GTCT ATTCTTTTGA TTTATAAGGG ATTTTGCCGA TTTCGGCCTA TTGGTTAAAA 

AATGAGCTGA TTTAACAAAA ATTTAACGCG AATTTTAACA AAATATTAAC GCTTACAATT 

TCAfeGTGGCA CTTTTCGGGG AAATGTGCGC GGAACCCCTA TTTGTTTATT TTTCTAAATA 

C^TCAAATA TGTATCCGCT CATGAGACAA TAACCCTGAT AAATGCTTCA ATAATATTGA 

W3GAAGA GT ATG AGT ATT CAA CAT TTC CGT GTC GCC CTT ATT CCC 
Met Ser lie Gin His Phe Arg Val Ala Leu He Pro 
395 400 405 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
708 
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ttt TTT GCG GCA TTT TGC CTT CCT GTT TTT GCT CAC CCA GAA ACG CTG 
J£ Te Sa Si Cys Leu Pro Val Phe Ala His Pro Glu Thr Leu 

410 415 

CTG AAA GTA AAA GAT GCT GAA GAT CAG TTG GGT GCA CGA GTG GGT 
?S J£i S2 Ss Asp Ala Glu Asp Gin Leu Gly Ala Arg Val Glv/Tyr 
425 430 



'CGC CCC 
Arg Pro 



ATC GAA CTG GAT CTC AAC AGC GGT AAG ATC CTT GAG AGT 
He Glu Leu Asp Leu Asn Ser Gly Lys He Leu Glu Ser 
440 445 450 

GAA GAA CGT TTT CCA ATG ATG AGC ACT TTT AAA GTT C?6 CTA TGT GGC 
SS SJ Arg Phe Pro Met Met Ser Thr Phe Lys Val 7eu Leu Cys Gly 
455 4 ^0 465 7 



GCG GTA TTA TCC CGT ATT GAC GCC GGG CAA GAG 
Ala Val Leu Ser Arg He Asp Ala Gly Gin Glu, 
475 4 80 



CTC GGT CGC CGC 
3ln Leu Gly Arg Arg 
485 



ATA CAC TAT TCT CAG AAT GAC TTG GTT GAG /AC TCA CCA GTC ACA GAA 
?2 SS Tyr Ser Gin Asn Asp Leu Val Glu/ryr Ser Pro Val Thr Glu 
490 4 95 / ^ uu 

AAG CAT CTT ACG GAT GGC ATG ACA GTa/gA GAA TTA TGC AGT GCT GCC 
Ss Leu Thr Asp Gly Met Thr Va/Arg Glu Leu Cys Ser Ala Ala 
505 510 7 



ATA ACC ATG AGT GAT AAC 
He Thr Met Ser Asp Asn Tl 
520 52 

GGA GGA CCG AAG GAG CTA ACC, 
Gly Gly Pro Lys Glu Leu Thi 
535 540 



icC AAC TTA CTT CTG ACA ACG ATC 
\Asn Leu Leu Leu Thr Thr He 
530 

TTG CAC AAC ATG GGG GAT CAT 
Leu His Asn Met Gly Asp His 
545 550 



GTA ACT CGC CTT GAT CGT/TGG GAA CCG GAG CTG AAT GAA GCC ATA CCA 
S£ 5£ £S Leu Asp Ar/ Trp Glu Pro Glu Leu Asn Glu Ala He Pro 
555 / 560 

AAC GAC GAG CGT GAc/cC ACG ATG CCT GTA GCA ATG GCA ACA ACG TTG 
Asn Asp Glu Arg Aso/Thr Thr Met Pro Val Ala Met Ala Thr Thr Leu 
570 / 575 

GGC GAA CTA CTT ACT CTA GCT TCC CGG CAA CAA 
Gly Glu Leu Leu Thr Leu Ala Ser Arg Gin Gin 
590 595 



CGC AAA CTA TTA 
Arg Lys Leu Leu> 
585 



TTG ATA GAC 
Leu He Asp 
600 

TCG GCC CI 
Ser Ala 
615 

GAG CGT/ GGC 
Glu Ari Gly 



lH3 ATG GAG GCG GAT AAA GTT GCA GGA CCA CTT CTG CGC 
fep Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu Arg 
605 610 

CCG GCT GGC TGG TTT ATT GCT GAT AAA TCT GGA GCC GGT 
Pro Ala Gly Trp Phe He Ala Asp Lys Ser Gly Ala Gly 
620 625 bJU 

TCT CGC GGT ATC ATT GCA GCA CTG GGG CCA GAT GGT AAG 
Ser Arg Gly He He Ala Ala Leu Gly Pro Asp Gly Lys 
635 640 645 



_ pp ^ r CCT ATC GTA GTT ATC TAC ACG ACG GGG AGT CAG GCA ACT ATG 
So/Sr S S S2 SI He Tyr Thr Thr Gly Ser Gin Ala Thr Met 
650 65 5 bbU 

GAA CGA AAT AGA CAG ATC GCT GAG ATA GGT GCC TCA CTG ATT AAG 
3 p Arg Asn Arg Gin He Ala Glu He Gly Ala Ser Leu He Lys 
665 670 675 



756 



804 



852 



900 



948 



996 



1044 



1092 



1140 



1188 



1236 



1284 



1332 



1380 



1428 



1476 



1524 
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S S TC S S £ S E £ S 5i 

680 1 5 

AGC GCA TTA GAG CTG CTT AAT GAG GTC GGA ATC GAA/dGT TTA ACA 
Asn Ser Ala Leu Glu Leu Leu Asn Glu Val Gly He ^ ^ 25 

ACC CGT AAA CTC GCC CAG AAG CTA GGT GTA GAG Ca/ CCT ACA TTG TAT 
Thr Arg Lys Leu Ala Gin Lys Leu Gly Val Glu G& Pro Thr Leu Tyr 
30 35 / 

»^ ..r rrr nrr TTG CTC G?£ GCC TTA GCC ATT GAG 
TGG CAT GTA AAA AAT AAG CGG GCT ill* l.i<- «v- ^ 

Trp His Val Lys Asn Lys Arg Ala Leu Leu/sp Ala Leu Ala lie uiu 

SS S SS S s ss S S5 Zfe S S IS S S5 5E 

60 65 ' 

TGG CAA GAT TTT TTA CGT AAT AAC AAA ACT TTT AGA TGT GCT TTA 

S 35 Sp Se Leu Arg Asn As/Xla Lys Ser Phe Arg Cys Ala Leu 

^5 * 1 u u 



t/CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin 
115 120 

GCA TTA TAT GCA CTC AGC GCA GTG GGG 
Ala Leu Tyr Ala Leu Ser Ala Val Gly 
130 135 

CAT TTT ACT TTA (fcT TGC GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT 
Hil Phe S ££y£y Cys Val Leu Glu Asp Gin Glu His Gin Val Ala 



GAA AAA CAG TAT GAA VAC 
Glu Lys Gin Tyr Glx/ Th: 
110 

CAA GGT TTT TCA 
Gin Gly Phe Ser 
125 




AAA GAA GAA 
Lys Glu Glu 
155 



GAA ACA CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA 
? g g£ tS Pro Thr Thr Asp Ser Met Pro Pro Leu Leu 
160 165 



s a; $ s ss s s ss ss £j s s as s s ss 

190 195 
TGT GAA AGT GGG TCT TAAAAGCAGC ATAACCTTTT TCCGTGATGG TAACTTCACT 
Cys GfLu Ser Gly Ser 
205 

AGT^TAAAAG GATCTAGGTG AAGATCCTTT TTGATAATCT CATGACCAAA ATCCCTTAAC 
gtIagttttc GTTCCACTGA gcgtcagacc CCGTAGAAAA GATCAAAGGA TCTTCTTGAG 
rCCTTTTTT TCTGCGCGTA ATCTGCTGCT TGCAAACAAA AAAACCACCG CTACCAGCGG 
rGGTTTGTTT GCCGGATCAA GAGCTACCAA CTCTTTTTCC GAAGGTAACT GGCTTCAGCA 
/gagcgcagat ACCAAATACT GTCCTTCTAG TGTAGCCGTA gttaggccac cacttcaaga 
ACTCTGTAGC ACCGCCTACA TACCTCGCTC TGCTAATCCT GTTACCAGTG GCTGCTGCCA 



1572 

1620 

1668 

1716 

1764 

1812 

1860 

1908 

1956 

2004 

2052 

2100 

2148 

2203 

2263 
2323 
2383 
2443 
2503 
2563 
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GTGGCGATAA 

AGCGGTCGGG 

CCGAACTGAG 

AGGCGGACAG 

CAGGGGGAAA 

GTCGATTTTT 

CCTTTTTACG 

TGGCCAGATG 

CACTCCCTAT 



GTCGTGTCTT 
CTGAACGGGG 
ATACCTACAG 
GTATCCGGTA 
CGCCTGGTAT 
GTGATGCTCG 
GTTCCTGGCC 
ATTAATTCCT 
CAGTGATAGA 



ACCGGGTTGG 
GGTTCGTGCA 
CGTGAGCTAT 
AGCGGCAGGG 
CTTTATAGTC 
TCAGGGGGGC 
TTTTGCTGGC 
AATTTTTGTT 
GAAAAGTGAA 



ACTCAAGACG 
CACAGCCCAG 
GAGAAAGCGC 
TCGGAACAGG 
CTGTCGGGTT 
GGAGCCTATG 
CTTTTGCTCA 
GACACTCTAT 
ATGAATAG 



ATAGTTACCG 
CTTGGAGCGA 
CACGCTTCCC 
AGAGCGCACG 
TCGCCACC 

!GCC 
CATG&CCCGA 



CGACAAAAAT 



GATAAGGCGC, 

GAAQgGAGAA 
1AGCTTC 
TGACTTGAGC 
AGCAACGCGG 
CACCATCGAA 
GTTATTTTAC 
CTAGA 



(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE 

(A) LENGTH : 286 

(B) TYPE: amino 
<D) TOPOLOGY: li 

(ii) MOLECULE TYPE: _ 
(xi) SEQUENCE DESCRIPTION: 



NO: 10: 



Met Ser He Gin His Phe Arg V&l Ala V Leu He Pro Phe Phe Ala Ala 
1 5/IO 

Phe Cys Leu Pro Val Phe His Pro Glu Thr Leu Val Lys Val Lys 
20 / 25 * U 

Asp Ala Glu Asp Gin Le/ciy Ala Arg Val Gly Tyr lie Glu Leu Asp 
35 / 40 43 

Leu Asn Ser Gly Lys/le Leu Glu Ser Phe Arg Pro Glu Glu Arg Phe 
50 / 55 60 

Pro Met Met Ser Thr Phe Lys Val Leu Leu Cys Gly Ala Val Leu Ser 
65 / 70 75 

Arg lie Asp Al/ Gly Gin Glu Gin Leu Gly Arg Arg lie His Tyr Ser 
1 85 90 

Gin Asn Asp/eu Val Glu Tyr Ser Pro Val Thr Glu Lys His Leu Thr 
'100 105 

Asp Gly uJ: Thr Val Arg Glu Leu Cys Ser Ala Ala lie Thr Met Ser 

- {5 120 

Asp Asn Lr Ala Ala Asn Leu Leu Leu Thr Thr lie Gly Gly Pro Lys 

130/ 135 1 

Glu Le/ T hr Ala Phe Leu His Asn Met Gly Asp His Val Thr Arg Leu 

145 / 150 

Asp Jrg Trp Glu Pro Glu Leu Asn Glu Ala He Pro Asn Asp Glu Arg 
^ 16 5 170 17b 

Asp/Thr Thr Met Pro Val Ala Met Ala Thr Thr Leu Arg Lys Leu Leu 
' 180 185 190 

-rJr Gly Glu Leu Leu Thr Leu Ala Ser Arg Gin Gin Leu He Asp Trp 
r y 195 200 205 



2623 

2683 

2743 

2803 

2863 

2923 

2983 

3043 

3098 
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Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu Arg Ser Ala Pro 
210 215 220 

Ala Gly Trp Phe lie Ala Asp Lys Ser Gly Ala Gly Glu^Krg Gly Ser 



225 



230 



Arg Gly He He Ala Ala Leu Gly Pro Asp Gly Lys^pfo Ser Arg He 



245 



Val Val He Tyr Thr Thr Gly Ser Gin Ala Thr y£t Asp Glu Arg Asn 



260 



265 



Arg Gin lie Ala Glu He Gly Ala Ser Leu ife Lys His Trp 



275 



280 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 207 amino ao^ids 

(B) TYPE: amino acid 
(D) TOPOLOGY: lin€S 



(ii) MOLECULE TYPE: p£ , 
(xi) SEQUENCE DESCRIBTTIC 

Met Ser Arg Leu Asp Lys Sej 
1 5 7 



'SEQ^ID NO: 11: 



He Asn Ser Ala Leu Glu Leu 
10 15 



Leu Asn Glu Val Gly lie Au Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 / 25 JU 

Lys Leu Gly Val Glu o/n Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 7 40 45 

Arg Ala Leu Leu As/ Ala Leu Ala He Glu Met Leu Asp Arg His His 
50 / 55 & u 

Thr His Phe Cys /ro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 / ^O 75 

Asn Asn Ala Ly4 Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
7 85 90 « 

Ala Lys val/6is Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
' ioo 1° 5 ±x 

Leu Glu Aj/n Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
- L - 120 12b 

Asn Ala/Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
13<y 135 140 

Val Ju Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 / 150 

Pro Air Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala He Glu Leu 
' 165 170 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
' iso 185 ■ L =' u 

tie He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
!95 200 205 



